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Derivative Nucleic Acids and Uses Thereof 

This application claims priority from provisional patent application 60/143,804, 
filed July 14, 1999, which is hereby incorporated by reference. 

Field of Invention 

This invention relates to the field of analyzing complex mixtures of nucleic acids 
for the presence of specific sequences and for the presence of specific sequence 
variations. 

Background 

Methods of the invention can be used to analyze complex mixtures of nucleic 
acids in small amounts of source material. Common applications are the quantification of 
messenger RNA levels for specific genes in an organism or tissue or the determination of 
the allele status of genetic polymorphisms in genomic DNA. Often there is a requirement 
to analyze many different nucleic acid sequences in a single sample, and it is often 
necessary to amphfy the target nucleic acids or signal molecules to obtain detectable 
signals in an assay. 

DNA microarrays provide the ability to analyze many target sequences in a 
sample. However, conventional microarray analysis is limited by the cost and difficulty 
of preparing large numbers of target molecules from limited amounts of sample and by 
poor hybridization specificity. Multiplexed analysis - amplifying target sequences or 
target-specific probe molecules from many targets in a single sample - can minimize the 
cost of reagents and the consumption of precious samples. Methods with better sequence 
specificity than simple hybridization reactions - e.g., analytical methods based on 
activities of nucleic acid modifying enzymes - provide improved reliability and accuracy 
of the quantification of specific sequences and the detection of specific polymorphisms. 




Multiplex polymerase chain reactions (PGR) have been used to decrease the 
sample preparation for scoring genetic variation by hybridization on DNA microarrays. 
Unfortunately many of the target sequences fail to ampUfy optimally, especially when 
large multiplex factors (large numbers of target sequences amplified simultaneously) are 
used (Wang et al, Science 280 :1077, 1998). Multiplex PGR has been performed with 5' 
extensions or "tails" on target-specific PGR primers which incorporate universal priming 
sequences into amplification products (Brownie et al, Nucleic Acids Research 25 :3235, 
1997; U.S. patent 5,858,989; Jeffreys et al; Favis et al, Nature Biotechnology 18 :561, 
2000). Amplification is performed for a limited number of cycles with these tailed target- 
specific primers, and then fiirther amplification is performed with high concentrations of 
universal primers with the same sequence as the 5' tails of the target-specific primers. 

Multiplex amplification with universal primers after appending 5' extensions on 
amplification products with adapter primers has been used with strand displacement 
amplification (US Patent 5,422,252, Walker et al.). Generic or universal primers have 
been used to amphfy ligatable probes (US patent 5,876,924, Zhang et al.; Thomas et al.. 
Arch Pathol Lab Med 123 :1 170, 1999). The use of generic primers for probe 
amplification is advantageous compared to convention target amplification with PGR, 
since primer binding and amplification is not subject to the variability of target 
sequences. The ligatable probes may be pairs of linear probes or a single circularizable 
probe. In the case of pairs of linear probes the generic primer sequences are incorporated 
into the 3' and 5' ends of the probes, respectively (US patent 5,876,924, Zhang et al.). In 
the case of circularizable probes, generic primer sequences are incorporated into the 
linker region between the target-specific termini of the probe. Amplification of 
circularizable probes may be performed with a single generic primer ("rolling circle 
amplification", RGA) or with a pair of generic PGR primers (US patent 5,876,924, Zhang 
et al.; Thomas et al.. Arch Pathol Lab Med 123 :1 170, 1999). Amplified circularizable 
probes are attractive altematives to PGR for multiplexed analysis (Isaksson and 
Landegren, Gurr Opin Biotechnol. 10 : 1 1, 1999). Ligation reaction provide excellent 
sequence discrimination for scoring SNPs (Thomas et al. Arch Pathol Lab Med 
123:1170, 1999; Favis et al, Nature Biotechnology 18 :561, 2000). 
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Another type of reaction that provides excellent sequence discrimination and that 
also provides amplification of the probes is the Invader assay developed by the Third 
Wave Technologies Company (US patent 5,846,717; US patent 5,888,780; US patent 
5,985,557; US patent 5,994,069; US patent 6,001,567). With this method pairs of probes 

5 are hybridized to nucleic acid targets, and an endonuclease enzyme effects cyclic 
structure-dependent cleavage of one of the probes, if the probe matches the target 
sequence. A fragment of arbitrary sequence with a ligatable 3' terminus ("flap") is 
cleaved from the probe by the enzyme. The reaction provides substantial amplification 
and high specificity for scoring of SNPs (Ryan et al., Mol Diagn 4 :135, 1999, Griffin et 

10 al, Proc Natl Acad Sci U S A 96 :6301, 1999) and has great potential for multiplexing the 
analysis of many targets in a single sample (Griffin et al., Proc Natl Acad Sci U S A 
96:6301, 1999). 

Capture of target sequences on microarrays of nucleic acid probes is limited by 
less than optimal discrimination between related but non-identical sequences and by the 

1 5 secondary structure of target molecules. Analysis of single nucleotide polymorphisms 
(SNPs) by hybridization on oligonucleotide arrays is error prone and requires the use of 
highly redundant sets of probes (Wang et al., Science 280 :1077, 1998; Cargill et al.. 
Nature Genetics 22 : 231, 1999). A method of enhanced oligonucleotide capture uses 
partially duplex probes and enzymatic reactions (ligation and/or primer extension) to 

20 accurately discriminate perfectly matched targets from those containing mismatches 
(Broude et al, Proc. Natl. Acad. Sci. 91 : 3072, 1994; US patent 5,503,980, Cantor; 
Gunderson et al.. Genome Research 8 :1 142, 1998). Analysis of the efficiency of 
hybridization of various nucleic acid sequences (Mir and Southern, Nature Biotechnology 
17:788,1999) has demonstrated that the hybridization is most efficient at single stranded 

25 regions near duplex stems, probably due to the presence of helical order and the absence 
of tertiary interactions. Ligation and polymerase reactions have been shown to provide 
excellent discrimination of perfectly and mismatched probes required for the scoring of 
SNPs (Thomas et al., Arch Pathol Lab Med 123 :1 170, 1999; Favis et al. Nature 
Biotechnology 18 :561, 2000; Pastinen, et al., Genome Research 7 :606, 1997). Thus 

30 capture of nucleic acid targets on partially duplex probes coupled with enzymatic 
discrimination provides for very high specificity. The probes capture only terminal 



G:\DATA\PLM\PKP\application.doc 



sequences on target molecules; internal target sequences that match the single stranded 
overhang sequences of the probes are not captured. 

Summary of the Invention 

In one aspect, the invention features, a method for multiplexed analysis of a 
plurality of target nucleic acid sequences in a sample. The method provides a derivative 
nucleic acid for each target sequence analyzed and present in the sample. A derivative 
nucleic acid is a nucleic acid which includes a capture tag sequence at one, or both, of its 
3' and 5' termini. The derivative sequence is produced only upon the hybridization of a 
probe or primer which includes the sequence tag as an internal fragment to the target 
sequence for which it is specific. The sequence tag is analyzed, e.g., by its ability to bind 
to a capture probe, e.g., a capture probe bound to a insoluble substrate, e.g., a bead, or an 
ordered array of partially duplex capture probes. The presence of the derivative nucleic 
acid, and its characteristic terminal sequence tag, is diagnostic of the presence of a 
selected target sequence. The method can evaluate, e.g., identify or quantitate, the 
presence of one, or preferably a plurality, of specific nucleic acid sequences, or the 
presence of specific sequence variations. The method includes the steps of: 

providing, for each target nucleic acid sequence to be analyzed, at least one 
probe/primer molecule which probe/primer molecule includes a region of sequence 
substantially complementary to a sequence in the target nucleic acid sequence and a 
region that is not located at either terminus of the probe/primer and which includes a 
capture tag sequence; 

forming a reaction mixture which includes the probe/primer molecules and the 
target sequences under conditions such that, if a probe/primer molecule specific for a 
target sequence and that target sequence are both present, one or a plurality of derivative 
molecules having a capture tag at one or both its 3' or 5* termini, of the probe specific for 
the target sequence, is generated, thereby producing a derivative nucleic acid suitable for 
evaluation; 

optionally, evaluating the presence of one or more sequence tags, e.g., by 
capturing the derivative nucleic acid molecules by hybridizing the tag sequences to 
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complementary single stranded overhangs on partially duplex probes, which are 
preferably spatially separated, (e.g., on beads or on an ordered array) such that the tag 
sequences are contiguous with the double stranded regions of the partially duplex probes 
and thereby analyzing said captured derivative nucleic acid molecules. In preferred 
embodiments the captured derivative nucleic acids are further analyzed, e.g., SNP's or 
other sequence events are evaluated, e.g., by a further reaction, e.g., an extension 
reaction. 

In a particularly preferred embodiment the derivative nucleic acid is ligated to a 
capture probe, e.g., to a partially duplex probe and then washed, preferably stringently 
washed, to achieve highly specific capture. 

In a preferred embodiment a partially duplex probes include a single molecule 
which is partially self-complementary and forms a hairpin structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety that allows for the 
immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 
which are bound by non-covalent means to form a structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety on one of the molecules that 
allows for the immobihzation and spatial separation of the probe molecules. 

In a preferred embodiment the partially duplex probes consists of a pair of 
molecules which are bound by covalent means to form a structure with either a 3' or 5' 
overhang and which contains a chemical moiety on one of the molecules that allows for 
the immobihzation and spatial separation of the probe molecules. 

In another aspect, the invention features, a method of providing a derivative 
nucleic acid having single-strand overhang suitable for analysis. The method includes: 

providing a first and second primer, 

wherein said first primer includes, preferably in the order of 5' to 3', 
a first region which includes a universal primer sequence (in a 
preferred embodiment the universal primer sequence is different for each primer of a pair, 
but the same pair is used for all targets) (occasionally referred to herein as Uf); 

a second region which includes a capture tag sequence and a 
cleavage site, e.g., a site for cleavage by a restriction enzyme sequence 
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(occasionally referred to herein as R/T), and 

a third region which can which can hybridize to a first region on 
the target nucleic acid sequence (occasionally referred to herein as Xf), 
wherein said second primer includes, preferably in the order of 5' to 3', 

a first region which includes a universal primer sequence 
(occasionally referred to herein as Ur); 

a second region which includes a capture tag sequence (the capture 
tag sequence on the second primer can be the same or different, and is 
preferably of different sequence, fi:om that of the capture tag of the first 
primer) and a cleavage site, e.g., a site for cleavage by a restriction enzyme 
sequence (occasionally referred to herein as R/T2), and 

a third region which can which can hybridize to a second region on the 
target nucleic acid sequence (occasionally referred to herein as Xr), 

forming a reaction mixture which includes the first and second primers and the 
target nucleic acid, and using the target as a template, extending the primers along the 
target nucleic acid, to produce an extended target strand, which preferably includes, in 
order, a universal primer sequence, a capture tag sequence, target sequence, a capture tag 
sequence, and a universal primer sequence; 

optionally amplifying, using the first and second primers, the extended target 

strand; 

contacting an extended target strand with a universal primer which binds to the 
universal primer sequence (and preferably not to regions on the amplified target strand 
other than the universal primer sequence) and extending the universal primers along the 
extended target strand to synthesize a extended target strand which includes, in order, a 
universal primer sequence, a capture tag sequence, target sequence, a capture tag 
sequence, and a universal primer sequence; 



G:\DATA\PLM\PKP\application.doc 




optionally amplifying, using universal primers, the extended target stranded 
species; 

cleaving at the cleavage site of one or both ends of a double stranded extended 
target molecule to provide a derivative nucleic acid which includes a double stranded 
molecule having an overhang which includes the capture tag sequence, preferably at one 
or both of the 3' and 5 'termini, 

thereby producing a target molecule having an overhang. 

Methods of the invention allow for multiplexed reactions, e.g., reactions in which 
two or more targets, and preferably as many as 10, 50, 100, 200, 500, 1000, or 5000, are 
analyzed. Targets can be on the same molecule or can be on different molecules. Thus, 
in a preferred embodiment the reaction mix includes a second target, and a third and a 
forth primer are included in the reaction mix, 

wherein the third primer includes, preferably in the order of 5' to 3', 

a first region which includes a universal primer sequence (in a 
preferred embodiment the universal primer sequence is different for each 
primer pair, but the same pair is used for all targets), 

a second region which includes a capture tag sequence (which is 
preferably different from the capture tag sequence on one or both of the 
first and second primers) and a cleavage site, e.g., a cleavage site for 
cleavage by a restriction enzyme, and 

a third region which can hybridize to a first region on the second 

target, 

wherein the fourth primer includes, preferably in the order of 5' to 3', 
a first region which includes a universal primer sequence; 
a second region which includes a capture tag sequence (which is 
preferably different form the capture tag sequence on one or both of the 
first and second primer, and which can be the same or different, and is 
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preferably of different sequence, from that of the capture tag of the third 
primer) and a cleavage site, e.g., a site for cleavage by a restriction 
enzyme, and 

a third region which can which can hybridize to a second region on 
the second target, 

forming a reaction mixture which includes the first, second, third and fourth 
primers and the two targets, and using the second target as a template, extending the third 
and fourth primers along the second target, to produce an extended second target strand, 
which preferably includes, in order, a universal primer sequence, a capture tag sequence, 
target sequence, a capture tag sequence, and a universal primer sequence (the extended 
second target will include a capture tag sequence which is different from that on the 
extended first target); 

optionally amplifying, using the third and fourth primers, the extended second 
target strand produced in the step above; 

contacting an extended second target strand with a universal primer which bind to 
the universal primer sequence (and preferably not to regions on the amplified second 
target strand other than the universal primer sequence) and extending the universal 
primers along the extended second target strand to synthesize a extended second target 
strand which includes, in order, a universal primer sequence, a capture tag sequence, 
target sequence, a capture tag sequence, and a universal primer sequence (wherein at least 
one capture tag is different from a capture tag on the first extended target); 

optionally ampUfying, using universal primers, the extended second target strand 
species; 

cleaving at the cleavage site of one or both ends of a double stranded second 
extended target molecule to provide a double stranded molecule having an overhang 
which include a capture tag sequence, at one or both of the 3' and 5' terminus, preferably 
at the 3' terminus. (Additional primers can be added to increase the number of target 

sequences analyzed.) 

The method can fiirther include analyzing the derivative nucleic acids of the 
method, e.g., by evaluating the capture tag sequence of an overhang (The derivative 
nucleic acids can also be analyzed by analysis of the target-specific regions of the 
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amplicons.) The sequence of the overhang(s) can provide information as to the sequence 
of the target nucleic acid(s). The analysis can be performed on a duplex molecule which 
has an overhang at one or both ends or on single strands. One or both single strands can 
be analyzed. In the case where both strands are analyzed it is preferable that the 
5 sequence tags on each strand be different from one another, allowing, e.g., independent 
analysis, e.g., on an array of capture probes. 

A preferred method of evaluating the derivative nucleic acid molecules having 
an overhang includes contacting the molecules with a capture probe, e.g., a capture probe 
bound to a insoluble substrate, e.g., a bead, or an ordered array of probes. E.g., the 
1 0 molecules can be hybridized to an array having a plurality of capture probes, wherein 

each of the capture probes is positionally distinguishable from the other capture probes of 
the plurality and has a unique variable region not repeated in another capture probe of tiie 
plurality. In a particularly preferred embodiment the derivative nucleic acid is ligated to a 
capture probe, e.g., to a partially duplex probe and then washed, preferably stringently 
1 5 washed, to achieve highly specific capture. The unique regions hybridize with the 
capture tag sequence of a single strand molecule or the single strand overhang(s) of a 
double stranded molecule, thereby allowing evaluation, e.g., identification or 
quantitation, or fiirther analysis, e.g., sequence analysis, by a variety of analytical 
methods, of a target molecule. A capture probe can be single or double stranded but 
20 preferably has a double stranded region and a single stranded region preferably it has a 3' 
end capable of serving as a priming site for extension. Preferably the unique region is a 
3' or 5' overhang preferably in the 3' single strand overhang of the capture probe. The 
template dependent extension of the 3' end of the capture probe is diagnostic for capture 
of the derivative nucleic acid. However, there are cases (e.g., when the derivative nucleic 
25 acids is to be analyzed by primer extension using a special primer) that the 3 ' end should 
be blocked so that no extension can occur. 

In a preferred embodiment a partially duplex probes include a single molecule 
which is partially self-complementary and forms a hairpin structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety that allows for the 
30 immobiUzation and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 
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which are bound by non-covalent means to fonn a structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety on one of the molecules that 
allows for the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes consists of a pair of 
molecules which are bound by covalent means to form a structure with either a 3' or 5' 
overhang and which contains a chemical moiety on one of the molecules that allows for 
the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel 

array. 

In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can 
increase the specificity of the evaluation. The enzyme mediated reaction can be: an 
extension reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a 
ligation, e.g., a reaction catalyzed by a Ugase; or a nucleic acid cleavage reaction, e.g., a 
cleavage catalyzed by a restriction enzyme, e.g., a Type lis enzyme. The derivative 
nucleic acid sequence which hybridizes with the capture probe can be the substrate in an 
enzyme mediated reaction, e.g., it can be ligated to a strand of the capture probe or it can 
be extended along a strand of the capture probe. Alternatively, the capture probe can be 
extended along the derivative nucleic acid sequence. Alternatively, a separate primer can 
be contacted with the derivative nucleic acid for the analysis. (Any of the extension 
reactors discussed herein can be performed with labeled, or chain terminating, subunits.) 
The capture probe duplex can be the substrate for a cleavage reaction. These reactions 
can be used to increase specificity of the method or to otherwise aid in detection, e.g., by 
providing a signal. A wash following a reaction, e.g., a ligase reaction, can improve 
detection and/or specificity. 

In a preferred embodiment the derivative nucleic acid is allowed to hybridize 
to the array and the 3' end of the captiire probe is extend across tiie region of the target 
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sequence, e.g., a genomic nucleic acid having a genetic event, with one or more 
tenninating base species, where if more than one is used each species has a unique 
distinguishable label e.g. label 1 for base A, label 2 for base T, label 3 for base G, and 
label 4 for base C; thereby analyzing the amplified sample sequence. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 

Methods of the invention can be used with a wide variety of samples and targets. 
In preferred embodiments the target is: a DNA molecule: all or part of a known gene; 
wild type DNA; mutant DNA; a genomic fi-agment, particularly a human genomic 
fragment; a cDNA, particularly a human cDNA. In preferred embodiments the target a 
cDNA the synthesis of which was directed by: an RNA molecule: an RNA transcript; 
wild type RNA; mutant RNA, or a human RNA. In preferred embodiments the target 
sequence is: a human sequence; a non-human sequence, e.g., a mouse, rat, pig, primate. 
In preferred embodiments the method is performed: on a sample fi-om a human subject; 
and a sample firom a prenatal subject; as part of genetic counseling; to determine if the 
individual fi-om which the target nucleic acid is taken should receive a drug or other 
treatment; to diagnose an individual for a disorder or for predisposition to a disorder; to 
stage a disease or disorder; to determine if a party should pay for a treatment, e.g., to 
determine if one party, e.g., an insurance company or government, should pay or 
reimbvirse a party which has paid for a treatment. 

Methods of U.S. 5,503,980 and or U.S. 5,631,134, both of which are hereby 
incorporated by reference can be used herein, particularly, the array and array-related 
steps recited herein can use methods taught in these patents. 

In another aspect, the invention features, a method of using Hgatable probes to 
provide a nucleic acid having single-strand overhangs suitable for analysis preferably a 
double strand nucleic acid with single strand overhangs at one or both ends, suitable for 
multiplex analysis e.g., suitable e.g., for detecting one or more genetic events, e.g., one or 
more single nucleotide polymorphisms, in a sample. 
The method includes: 
providing a first and second probe, 

wherein said first probe includes, preferably in the order of 5' to 3', 
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a first region which includes a universal primer sequence (referred 
to occasionally herein as Uf); 

a second region which includes a capture tag sequence and a 
cleavage site, e.g., a site for cleavage by a restriction enzyme (referred to 
occasionally herein as R/Tl), and 

a third region which can which can hybridize to a first region on 
the target nucleic acid (referred to occasionally herein as X5'), 
wherein said second probe/primer includes, preferably in the order of 3' to 

5\ 

a first region which can which can hybridize to a second region on 
the target nucleic acid (the first and second region of the target sequence 
can abut, or can be separated e.g., by 1 or up to 2, 3, 4, 5, 10 or 20 
nucleotides, in the case where the regions, the first and second probe can 
be joined by ligation, in the case where they are separated by one or more 
nucleotides the first and second primer can be joined by polymerase 
directed synthesis and ligation) (referred to occasionally herein as X3'), 

a second region which includes a capture tag sequence (the capture 
tag sequence on the second probe can be the same or different, and is 
preferably of different sequence, firom that of the capture tag of the first 
probe) and a cleavage site, e.g., a site for cleavage by a restriction enzyme 
(referred to occasionally herein as R/T2), and 

a third region which includes a universal primer sequence (referred 
to occasionally herein as Ur); 
forming a reaction mixture which includes the first and second probes and the 
target nucleic, under conditions wherein the first and second probes are joined, e.g., by 
ligation, if the target is of a first sequence and not joined if the target is of a second 
sequence, to produce a joined probe, which preferably includes, in order, a universal 
primer sequence, a capture tag sequence, target sequence, a capture tag sequence, and a 
universal primer sequence, preferably in the order 5' to 3'; 

optionally, contacting a joined probe with a pair of universal primers which bind 
to the universal primer sequences (and preferably not to regions on the joined probe other 
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than the universal primer sequences) and extending the universal primer sequences along 
the joined probe strand (or its complement) to produce one or a plurality of double 
stranded molecules having, in order, a universal primer sequence, a capture tag sequence, 
target sequence, a capture tag sequence, and a universal primer sequence; 

cleaving at the cleavage site of one or both ends of a double stranded extended 
target molecule to provide a derivative nucleic acid which is a double stranded molecule 
having overhangs which include the capture tag sequence at one or both of the 3' and 5' 
termini, preferably at the 3' terminus. 

thereby producing a target molecule having overhangs. 

Methods of the invention allow for multiplexed reactions, e.g., reactions in which 
two or more targets, and preferably as many as 10, 50, 100, 200, 500, 1000, or 5000 
target sequences, are analyzed. Targets can be on the same molecule or can be on 
different molecules. In the examples below an embodiment wherein the targets are on 
different molecules is described, but the method can be used to analyze different regions 
of a single molecule. Thus, in a preferred embodiment the reaction mix includes a 
second target, and a third and a forth probe are included in the reaction mix, 

wherein said third probe includes, preferably in the order of 5' to 3', 
a first region which includes a universal primer sequence; 
a second region which includes a capture tag sequence (which is 
preferably different form the capture tag sequence on one or both of the 
first and second probe) and a cleavage site, e.g., a site for cleavage by a 
restriction enzyme, and 

a third region which can which can hybridize to a first region on 
the target nucleic acid, 

wherein said fourth probe includes, preferably in the order of 5' to 3 \ 

a first region which can which can hybridize to a second region on 
the target nucleic acid (the first and second region of the target sequence 

can abut, or can be separated by 1 or ^nucleotides, in the case where 

the regions about, the third and fourth probe can be joined by ligation, in 
the case where they are separated by one or more nucleotides the third and 
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fourth probe/primer can be joined by polymerase directed synthesis and 
ligation), 

a second region which includes a capture tag sequence (which is 
preferably different form the capture tag sequence on one or both of the 
first and second primer, and which can be the same or different, and is 
preferably of different sequence, fi-om that of the capture tag of the third 
primer) and a cleavage site, e.g., a site for cleavage by a restriction 
enzyme, and 

a third region which includes a universal primer sequence; 

forming a reaction mixture which includes the first, second, third and fourth probe 
and the two targets xmder conditions wherein the third and fourth probe are joined, e.g., 
by Ugation, if the second target is of a first sequence and not joined if the second target is 
of a second sequence, to produce a second joined probe/primer, which preferably 
includes, in order, a universal primer sequence, a capture tag sequence, target sequence, a 
capture tag sequence, and a universal primer sequence, preferably in the order 5' to 3'; 

optionally, contacting a second joined probe with a pair of universal primers 
which bind to the universal primer sequences (and preferably not to regions on the second 
joined probe other than the universal primer sequences) and extending the universal 
primers along the second joined probe strand (or its complement) to produce one or a 
plurality of double stranded molecules having, in order, a universal primer sequence, a 
capture tag sequence, second target sequence, a capture tag sequence, and a universal 
primer sequence; 

cleaving at the cleavage site of one or both ends of a double strand joined probe to 
provide a derivative nucleic acid which is a double stranded molecule having overhangs 
which include the capture tag sequence at one or both of the 3' and 5' termini, preferably 
at the 3' terminus, 

The method can further include analyzing the derivative nucleic acids of the 
method, e.g., by evaluating the capture tag sequence of an overhang (The derivative 
nucleic acids can also be analyzed by analysis of the target-specific regions of the 
amplicons). The sequence of the overhang(s) can provide information as to the sequence 
of the target nucleic acid(s). The analysis can be performed on a duplex molecule which 
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has an overhang at one or both ends or on single strands. One or both single strands can 
be analyzed. In the case where both strands are analyzed it is preferable that the 
sequence tags on each strand be different from one another, allowing, e.g., independent 
analysis, e.g., on an array of capture probes. 

A preferred method of evaluating the derivative nucleic acid molecules having an 
overhang includes contacting the molecules with a capture probe, e.g., a capture probe 
bound to a insoluble substrate, e.g., a bead, or an ordered array of probes. E.g., the 
molecules can be hybridized to an array having a plurality of capture probes, wherein 
each of the capture probes is positionally distinguishable from the other capture probes of 
the pluraUty and has a unique variable region not repeated in another capture probe of the 
plurality. In a particularly preferred embodiment the derivative nucleic acid is ligated to a 
capture probe, e.g., to a partially duplex probe and then washed, preferably stringently 
washed, to achieve highly specific capture. The unique regions hybridize with the 
capture tag sequence of a single sfrand molecule or the single sfrand overhang(s) of a 
double sfranded molecule, thereby allowing evaluation, e.g., identification or 
quantitation, or fiirther analysis, e.g., sequence analysis, by a variety of analytical 
methods, of a target molecule. A capture probe can be single or double sfranded but 
preferably has a double stranded region and a single sfranded region. Preferably it has a 
3' end capable of serving as a priming site for extension. Preferably the unique region is 
in a 3' or 5' overhang preferably the 3' single sfrand overhang of the capture probe. The 
template dependent extension of the 3' end of the capture probe is diagnostic for capture 
of the derivative nucleic acid. However, there are cases (e.g., when the derivative nucleic 
acids is to be analyzed by primer extension using a special primer) that the 3' end should 
be blocked so that no extension can occur. 

In a preferred embodiment a partially duplex probes include a single molecule 
which is partially self-complementary and forms a hairpin structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety that allows for the 
immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 
which are bound by non-covalent means to form a structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety on one of the molecules that 
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allows for the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes consists of a pair of 
molecules which are bound by covalent means to form a structure with either a 3' or 5' 
overhang and which contains a chemical moiety on one of the molecules that allows for 
5 the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel array. 
In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
e.g., by MALDI-TOF mass spectrophotometry. 
10 In preferred embodiments the method includes one or more enzyme mediated 

reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can 
increase the specificity of the evaluation. The enzyme mediated reaction can be: an 
extension reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a 
1 5 ligation, e.g., a reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a 
cleavage catalyzed by a restriction enzyme, e.g., a Type lis enzyme. The derivative 
nucleic acid sequence which hybridizes with the capture probe can be the substrate in an 
enzyme mediated reaction, e.g., it can be Ugated to a strand of the capture probe or it can 
be extended along a strand of the capture probe. Alternatively, the capture probe can be 
20 extended along the derivative nucleic acid sequence. Alternatively, a separate primer can 
be contacted with the derivative nucleic acid for the analysis. (Any of the extension 
reactors discussed herein can be performed with labeled, or chain terminating, subunits.) 
The capture probe duplex can be the substrate for a cleavage reaction. These reactions 
can be used to increase specificity of the method or to otherwise aid in detection, e.g., by 
25 providing a signal. A wash following a reaction, e.g., a ligase reaction, can improve 
detection and/or specificity. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 

Methods of the invention can be used with a wide variety of samples and targets. 
30 In preferred embodiments the target is: a DNA molecule: all or part of a known gene; 
wild type DNA; mutant DNA; a genomic fragment, particularly a human genomic 
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fragment; a cDNA, particularly a human cDNA. In preferred embodiments the target a 
cDNA the synthesis of which was directed by: an RNA molecule: an RNA transcript; 
wild type RNA; mutant RNA, or a human RNA. In preferred embodiments the target 
sequence is: a human sequence; a non-human sequence, e.g., a mouse, rat, pig, primate. 
In preferred embodiments the method is performed: on a sample from a human subject; 
and a sample from a prenatal subject; as part of genetic counseling; to determine if the 
individual from which the target nucleic acid is taken should receive a drug or other 
freatment; to diagnose an individual for a disorder or for predisposition to a disorder; to 
stage a disease or disorder; to determine if a party should pay for a treatment, e.g., to 
determine if one party, e.g., an insurance company or government, should pay or 
reimburse a party which has paid for a treatment. 

In one aspect, the invention features a method of providing a nucleic acid with 
single sfrand overhang, preferably a double strand nucleic acid with single strand 
overhangs at one or both ends, suitable for multiplex analysis, e.g., suitable for detecting 
one or more genetic events, e.g., one or more single nucleotide polymorphisms, in a 
sample. 

The method includes: 

(1) providing a sample which includes one or a plurality of target nucleic acid 

sequences; 

(2) providing a first single stranded linear probe, wherein the first single- 
stranded linear probe includes, 

at one terminus, a first region which is complementary to a first region on a 
first target, and at its other terminus a second region which is complementary to a second 
region on a first target, wherein the first and second region on the first target can be 
directly or can be separated by one or more nucleotides (upon hybridization of the first 
and second regions to the target, the termini of the probe can be joined, e.g., by ligation, 
or by polymerase catalyzed extension and ligation, thus the probe will be circularized 
only when the target sequence for which the probe is specific is present in the sample), 

a cleavage site 

a capture tag sequence, wherein the cleavage site and capture tag sequence are 
disposed such that cleavage results in a single-stranded overhang, at one or both of the 3' 
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and 5' termini, preferably a 3' single stranded overhang, which includes at its terminus, 
the capture sequence tag, and 

a universal primer sequence; 

(3) providing a second single stranded Unear probe, wherein the second 
single-stranded probe includes, 

at one terminus, a first region which is complementary to a first region on a 
second target, and at its other terminus a second region which is complementary to a 
second region on a second target, wherein the first and second region on the first target 
can be directly or can be separated by one or more nucleotides, 

a cleavage site 

a capture tag sequence, which capture sequence preferably differs in sequence 
fi-om the capture tag on the first single stranded probe, and wherein the cleavage site and 
capture tag sequence are disposed such that cleavage results in a single-stranded 
overhang, at one or both of its 3' and 5' termini, preferably a 3' single stranded overhang, 
which includes at its terminus, the capture sequence tag, and 

a universal primer sequence, which is preferably, of the same sequence as the 
universal primer sequence on the first single stranded probe; 

contacting the first single stranded probe with the first target and the second 
single stranded probe with the second target under conditions which allow the 
circularization of the a single stranded circular probe if to be circularized if a target is 
present which is homologous to its terminal target binding regions; 

contacting the first and second probes with a universal primer under 
conditions which allow rolling circle ampUfication and produce double stranded 
amplification product; 

cleaving the double stranded ampUfication product at cleavage sites, e.g., with 
a restriction enzyme, e.g., a type II restiiction enzyme, to provide cleaved product having 
a capUxre tag sequence at the terminus of a single sft-anded overhang. 

The method can fiirther include analyzing the derivative nucleic acids of the 
method, e.g., by evaluating the capture tag sequence of an overhang (The derivative 
nucleic acids can also be analyzed by analysis of the target-specific regions of the 
amplicons). The sequence of the overhang(s) can provide information as to the sequence 
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of the target nucleic acid(s). The analysis can be performed on a duplex molecule which 
has an overhang at one or both ends or on single strands. One or both single strands can 
be analyzed. In the case where both strands are analyzed it is preferable that the 
sequence tags on each strand be different from one another, allowing, e.g., independent 
5 analysis, e.g., on an array of capture probes. 

A preferred method of evaluating the derivative nucleic acid molecules having an 
overhang includes contacting the molecules with a capture probe, e.g., a capture probe 
bound to a insoluble substrate, e.g., a bead, or an ordered array of probes. E.g., the 
molecules can be hybridized to an array having a plurality of capture probes, wherein 
1 0 each of the capture probes is positionally distinguishable from the other capture probes of 
the plurality and has a unique variable region not repeated in another capture probe of the 
pluraUty. In a particularly preferred embodiment the derivative nucleic acid is Ugated to a 
capture probe, e.g., to a partially duplex probe and then washed, preferably stringently 
washed, to achieve highly specific capture. The unique regions hybridize with the 
1 5 capture tag sequence of a single strand molecule or the single strand overhang(s) of a 
double stranded molecule, thereby allowing evaluation, e.g., identification or 
quantitation, or further analysis, e.g., sequence analysis, by a variety of analytical 
methods, of a target molecule. A capture probe can be single or double stranded but 
preferably has a double stranded region and a single stranded region. Preferably it has a 
20 3' end capable of serving as a priming site for extension. Preferably the unique region is 
in a 3' or 5' overhang, preferably the 3' single strand overhang of the capture probe. The 
template dependent extension of the 3' end of the capture probe is diagnostic for capture 
of the derivative nucleic acid. However, there are cases (e.g., when the derivative nucleic 
acids is to be analyzed by primer extension using a special primer) that the 3' end should 
25 be blocked so that no extension can occur. 

In a preferred embodiment the partially duplex probes include a single molecule 
which is partially self-complementary and forms a hairpin structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety that allows for the 
immobilization and spatial separation of the probe molecules. 
30 In a preferred embodiment the partially duplex probes include a pair of molecules 

which are bound by non-covalent means to form a structure with either a 3 ' or 5 ' 
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overhang and optionally which contains a chemical moiety on one of the molecules that 
allows for the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the partially duplex probes consists of a pair of 
molecules which are bound by covalent means to form a structure with either a 3' or 5' 
5 overhang and which contains a chemical moiety on one of the molecules that allows for 
the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel array. 
In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
10 e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
9 nucleic acid, is the substrate or template for the enzyme mediated reaction. This can 

m increase the specificity of the evaluation. The enzyme mediated reaction can be: an 

1 5 extension reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a 
^ ligation, e.g., a reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a 

\4 cleavage catalyzed by a restriction enzyme, e.g., a Type lis enzyme. The derivative 

% nucleic acid sequence which hybridizes with the capture probe can be the substrate in an 

y enzyme mediated reaction, e.g., it can be ligated to a strand of the capture probe or it can 

> 20 be extended along a strand of the capture probe. Alternatively, the capture probe can be 
? extended alone the derivative nucleic acid sequence. Alternatively, a separate primer can 

be contacted with the derivative nucleic acid for the analysis. (Any of the extension 
reactors discussed herein can be performed with labeled, or chain terminating, subunits.) 
The capture probe duplex can be the substrate for a cleavage reaction. These reactions 
25 can be used to increase specificity of the method or to otherwise aid in detection, e.g., by 
providing a signal. A wash following a reaction, e.g., a ligase reaction, can improve 
detection and/or specificity. 

In a preferred embodiment the derivative nucleic acid is allowed to hybridize to 
the array and the 3' end of the capture probe is extend across the region of the target 
30 sequence, e.g., a genomic nucleic acid having a genetic event, with one or more 
terminating base species, where if more than one is used each species has a unique 
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distinguishable label e.g. label 1 for base A, label 2 for base T, label 3 for base G, and 
label 4 for base C; thereby analyzing the amplified sample sequence. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 
5 In preferred embodiments, a genetic event is within a region which hybridizes to a 

probe, e.g., is 1, 2, 3, 4 or 5 base pairs from the end of a region which hybridizes to a 
probe, or is at or sufficiently close to the end of the a region which hybridizes to a probe 
that a mismatch would inhibit Ugation or DNA polymerase-based extension. 

Methods of the invention can be used with a wide variety of samples and targets. 
1 0 In preferred embodiments the target is: a DNA molecule: all or part of a known gene; 
wild type DNA; mutant DNA; a genomic fragment, particularly a human genomic 
fragment; a cDNA, particularly a human cDNA. In preferred embodiments the target a 
cDNA the synthesis of which was directed by: an RNA molecule: an RNA transcript; 
5 wild type RNA; mutant RNA, or a human RNA. In preferred embodiments the target 

m 1 5 sequence is: a human sequence; a non-human sequence, e.g., a mouse, rat, pig, pnmate. 
^ In preferred embodiments the method is performed: on a sample from a human subject; 

5 and a sample from a prenatal subject; as part of genetic counseling; to determine if the 

individual from which the target nucleic acid is taken should receive a drug or other 
treatment; to diagnose an individual for a disorder or for predisposition to a disorder; to 
20 stage a disease or disorder; to determine if a party should pay for a treatment, e.g., to 
determine if one party, e.g., an insurance company or government, should pay or 
reimburse a party which has paid for a treatment. 

In a preferred embodiment the single stranded linearized circular probe will have 
its termini joined only when there is perfect complementarity between the regions of the 
25 probe which hybridize to the target and the target. 

In a preferred embodiment more than two, e.g., and as many as 10, 50, 100, 200, 
500, 1000, or 5000 target sequences are present and a single strand linearized circular 
probe specific for each is contacted with the sample. 

In some embodiments a single nucleic acid molecule will include more than one 
30 target sequence, in others, target sequences are on separate molecules, e.g., separate 
restriction or shear fragments. 
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In a preferred embodiment the method is used to detect a genetic event, e.g., a 
mutation or an SNP, in a target sequence. The nucleotide at complementary to the 
genetic event or to a nucleotide of a genetic event, can be degenerate, e.g., the method 
can include the use of a plurality of primers which differ from each other at the 
interrogation site, e.g., four primers, one each with a, g, c, and t, at the interrogation site. 
Each of the plurality of primers will have a different capture tag, thus the method allows 
identification or sequencing of the nucleotide of interest. 

(b) combining the circular template with an effective amount of a RCA 
primer, at least two types of nucleotide triphosphates and an effective amount of a 
polymerase enzyme to yield a product, e.g., a single-stranded oligonucleotide multimer 
complementary to the circular oligonucleotide template; and 
In preferred embodiments ampUfication reactions are performed isothermally. 

In preferred embodiments, analyzing a sample polynucleotide sequence includes, 
e.g., sequencing the polynucleotide sequence, e.g., by sequencing by hybridization or 
positional sequencing by hybridization, detecting the presence of, or identifying, a 
genetic event, e.g., a SNP, in a target nucleic acid, e.g., a DNA. 

In preferred embodiments, the genetic event is within 1, 2, 3, 4 or 5 base pairs 
from the end of the target region which hybridizes to the probe, or is sufficiently close to 
the end of the target region which hybridizes to the probe that a mismatch would inhibit 
DNA polymerase-based extension from a target/ primed circle. 

In preferred embodiments, the target nucleic acid is amplified, e.g., by PGR, prior 
to contact with a circular template. 

Preferably, a circular template has about 15-1500 nucleotides, and more 
preferably about 24-500 nucleotides and most preferably about 30-150 nucleotides. 

The polymerase enzyme can be any that effects the synthesis of the multimer, 
e.g., any polymerase described in 5, 714, 320. Generally, the definitions provided for 
circular vectors and their amplification in 5, 714,320, apply to terms used herein, unless 
there is a conflict between the terms in which case the meaning provided herein controls. 
5, 714, 320, and all other U.S. patents mentioned herein are incorporated by reference. 

In another aspect, the invention features a method of providing a nucleic acid with 
single strand overhang, preferably a double strand nucleic acid with single strand 
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overhangs at one or both ends, suitable for multiplex analysis e.g., suitable for detecting 
one or more genetic events, e.g., one or more single nucleotide polymorphisms, in a 
sample. 

The method includes: 

(a) providing a target nucleic acid having a first and second region, wherein 
the two regions preferably overlap; 

providing an invader probe which is complementary to the first region of the 

target, 

providing a signal probe having, in the 5' to 3' direction, (optionally) a signal 
sequence, a capture tag sequence, and a region complementary to the second region of the 
target nucleic acid; 

(b) contacting the target sequence with the invader probe and the signal probe, 
under conditions wherein the invader probe and an end, e.g., the 3' end of the signal 
probe are annealed to the target nucleic acid sequence so as to create a cleavage structure 
having a single-stranded arm which includes, optionally, the signal sequence, and the 
capture tag; 

(c) cleaving the first cleavage structure under conditions such that cleavage of 
the cleavage structure occurs at a site located within the signal probe in a manner 
dependent upon the amiealing of the invader and signal probes on the target nucleic acid 
such that cleavage liberates the single-stranded arm of the cleavage structure to generate 
a derivative nucleic acid which has the capture tag sequence at a terminus, preferably its 
3' terminus; 

(d) optionally allowing a subsequent, or a plurality of subsequent copies of the 
signal probe to anneal and be cleaved to produce an additional or a plurality of additional 
derivative nucleic acids which have the capture tag sequence at a terminus, preferably the 
3' terminus; 

thereby providing a derivative nucleic acid suitable for analysis. 

In a preferred embodiment, the signal probe can include a sequence 5' to the 
capture tag which self hybridizes to form a hairpin structure, such that cleavage results in 
a structure having a hair pin with a single strand overhang. The capture tag is on the 
single strand overhangs which is preferably a 3' overhang. 



G:\DATA\PLM\PKP\application.doc 



24 

Methods of the invention allow for multiplexed reactions, e.g., reactions in which 
two or more targets, and preferably as many as 10, 50, 100, 200, 500, 1000, or 5000, are 
analyzed. Targets can be on the same molecule or can be on different molecules. In the 
examples below an embodiment wherein the targets are on different molecules is 
5 described, but the method can be used to analyze different regions of a single molecule. 
Thus, in a preferred embodiment the reaction mix includes a second target having a first 
and second region, and a second invader and signal probe, wherein 

the second invader probe is complementary to a first region of a second target, 
the second signal probe includes, in the 5' to 3' direction,(optionally) a signal 
10 sequence, a capture tag sequence (which is preferably different in sequence fi-om the 
capture tag sequence on the first signal probe, and a region complementary to a second 
region of the second target nucleic acid; 

(b) contacting the second target sequence with the second invader probe and 
the second signal probe, under conditions wherein the second invader probe and an end, 

15 e.g., the 3' end of the second signal probe are annealed to the second target nucleic acid 
sequence so as to create a second cleavage structure having a single-stranded arm which 
includes, optionally, the signal sequences, and the capture tag; 

(c) cleaving the second cleavage structure under conditions such that cleavage 
of the second cleavage structure occurs at a site located within the second signal probe in 

20 a manner dependent upon the annealing of the second invader and signal probes on the 
target nucleic acid such that cleavage liberates the single-stranded arm of the cleavage 
structure to generate a second derivative nucleic acid which has the capture tag sequence 
at a terminus, preferably its 3' terminus; 

(d) optionally allowing a subsequent, or a plurality of subsequent copies of the 
25 second signal probe to anneal and be cleaved to produce an additional or a pluraUty of 

additional derivative nucleic acids which have the capture tag sequence at a terminus, 
preferably the 3' terminus. 

In a preferred embodiment, the signal probe can include a sequence 5* to the 
capture tag which self hybridizes to form a hairpin structure, such that cleavage results in 
30 a structure having a hair pin with a single strand overhang. The capture tag is on the 
single strand overhangs which is preferably a 3' overhang. 
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The method can further include analyzing the derivative nucleic acids of the 
method, e.g., by evaluating the capture tag sequence of an overhang (The derivative 
nucleic acids can also be analyzed by analysis of the target-specific regions of the 
amplicons). The sequence of the overhang(s) can provide information as to the sequence 
of the target nucleic acid(s). The analysis can be performed on a duplex molecule which 
has an overhang at one or both ends or on single strands. One or both single strands can 
be analyzed. In the case where both strands are analyzed it is preferable that the 
sequence tags on each strand be different from one another, allowing, e.g., independent 
analysis, e.g., on an array of capture probes. 

A preferred method of evaluating the derivative nucleic acid molecules having an 
overhang includes contacting the molecules with a capture probe, e.g., a capture probe 
bound to a insoluble substrate, e.g., a bead, or an ordered array of probes. E.g., the 
molecules can be hybridized to an array having a plurality of capture probes, wherein 
each of the capture probes is positionally distinguishable from the other capture probes of 
the plurality and has a unique variable region not repeated in another capture probe of the 
plurality. In a particularly preferred embodiment the derivative nucleic acid is Ugated to a 
capture probe, e.g., to a partially duplex probe and then washed, preferably stringently 
washed, to achieve highly specific capture. The unique regions hybridize with the 
capture tag sequence of a single strand molecule or the single strand overhang(s) of a 
double sfranded molecule, thereby allowing evaluation, e.g., identification or 
quantitation, or further analysis, e.g., sequence analysis, by a variety of analytical 
methods, of a target molecule. A capture probe can be single or double stranded but 
preferably has a double stranded region and a single stranded region. Preferably it has a 
3' end capable of serving as a priming site for extension. Preferably the unique region is 
in a 3' or 5' overhang, preferably the 3' single strand overhang of the capture probe. The 
template dependent extension of the 3' end of the capture probe can be diagnostic for 
capture of the derivative nucleic acid. However, there are cases (e.g., when the derivative 
nucleic acids is to be analyzed by primer extension using a special primer) that the 3' end 
should be blocked so that no extension can occur. 

In a preferred embodiment a partially duplex probes include a single molecule 
which is partially self-complementary and forms a hairpin structure with either a 3' or 5' 
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overhang and optionally which contains a chemical moiety that allows for the 
immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 
which are bound by non-covalent means to form a structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety on one of the molecules that 
allows for the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes consists of a pair of 
molecules which are bound by covalent means to form a structure with either a 3' or 5' 
overhang and which contains a chemical moiety on one of the molecules that allows for 
the immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel array. 
In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can 
increase the specificity of the evaluation. The enzyme mediated reaction can be: an 
extension reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a 
ligation, e.g., a reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a 
cleavage catalyzed by a restriction enzyme, e.g., a Type lis enzyme. The derivative 
nucleic acid sequence which hybridizes with the capture probe can be the substrate in an 
enzyme mediated reaction, e.g., it can be ligated to a strand of the capture probe or it can 
be extended along a strand of the capture probe. Altematively, the capture probe can be 
extended along the derivative nucleic acid sequence. Altematively, a separate primer can 
be contacted with the derivative nucleic acid for the analysis. (Any of the extension 
reactors discussed herein can be performed with labeled, or chain terminating, subunits.) 
The capture probe duplex can be the substrate for a cleavage reaction. These reactions 
can be used to increase specificity of the method or to otherwise aid in detection, e.g., by 
providing a signal. A wash following a reaction, e.g., a ligase reaction, can improve 
detection and/or specificity. 
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In a preferred embodiment the derivative nucleic acid is allowed to hybridize to 
the array and the 3* end of the capture probe is extend across the region of the target 
sequence, e.g., a genomic nucleic acid having a genetic event, with one or more 
terminating base species, where if more than one is used each species has a unique 
5 distinguishable label e.g. label 1 for base A, label 2 for base T, label 3 for base G, and 
label 4 for base C; thereby analyzing the ampUfied sample sequence. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 

In preferred embodiments, a genetic event is within a region which hybridizes to a 
10 probe, e.g., is 1, 2, 3, 4 or 5 base pairs from the end of a region which hybridizes to a 
probe, or is at or sufficiently close to the end of the a region which hybridizes to a probe 
that a mismatch would inhibit ligation or DNA polymerase-based extension. 

Methods of the invention can be used with a wide variety of samples and targets. 
In preferred embodiments the target is: a DNA molecule: all or part of a known gene; 
1 5 wild type DNA; mutant DNA; a genomic fragment, particularly a human genomic 

fragment; a cDNA, particularly a human cDNA. In preferred embodiments the target a 
cDNA the synthesis of which was directed by: an RNA molecule: an RNA transcript; 
wild type RNA; mutant RNA, or a human RNA. In preferred embodiments the target 
sequence is: a human sequence; a non-human sequence, e.g., a mouse, rat, pig, primate. 
20 In preferred embodiments the method is performed: on a sample from a human subject; 
and a sample from a prenatal subject; as part of genetic counseling; to determine if the 
individual from which the target nucleic acid is taken should receive a drug or other 
treatment; to diagnose an individual for a disorder or for predisposition to a disorder; to 
stage a disease or disorder; to determine if a party should pay for a treatment, e.g., to 
25 determine if one party, e.g., an insurance company or government, should pay or 
reimburse a party which has paid for a treatment. 

The cleavage structure can be cleaved with, e.g., a cleavage agent, such as those 
described in U.S. Patent No. 5,871,91 1. By "cleavage agent" is meant a molecule such as 
a DNA polymerase (DNAP), a domain of a DNAP, or a synthetically created protein or 
30 peptide, capable of cleaving a cleavage structure at a specific site. These enzymes are 
sometimes referred to as "cleavases" or as "flap endonucleases". This activity is 
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sometimes associated with polymerases, but the activity of course is always an 
endonuclease. A preferable example of a cleavage agent is a 5' nuclease activity of 
DNAP, such as DNAPTaq, DNAPTfl, DNAPTth and DNAPEcl. After exposing the 
cleavage structure to a cleavage agent, the structure and agent are incubated under 
conditions wherein cleavage can occur. 

The derivative nucleic acid has a capture tag sequence, e.g., a sequence which is 
complementary to capture array probes. The sequences complementary to capture probes 
can be positioned within the second oligonucleotide such that they are at the 3' end of the 
third oligonucleotide upon cleavage of the second oligonucleotide. Thus, products from a 
multiplexed assay can be captured at different positions on an array of capture probes. 

In some embodiments, the sequence at the 5' end of the derivative nucleic acid 
includes an arbitrary sequence and can includes a label, e.g., a primary label, such as 
biotin. When biotin us used, a streptavidin labeled enzyme can be attached to detect the 
captured third oligonucleotide. 

In some embodiments, the 5' end of the derivative nucleic acid includes a 
sequence tag, which can encode information, e.g., a specific allele for a polymorphic 
DNA sequence. The information can be detected using different reporter moieties, e.g., 
different colors of fluorophors. hi some embodiments, the third oligonucleotide is 
detected using enzyme-based chemiluminescence or fluorogenic amplification. 

In some embodiments, the method is used to analyze a genetic even, e.g., a SNP 
or a mutation. The genetic event can occur in the first region, the second region, or at the 
junction of the two regions. 

In some embodiments, a biallelic or multi-alleUc SNP is analyzed. 
In some embodiments, the third oUgonucleotides from a plurality of sites are 
analyzed. 

In some embodiments, the derivative nucleic acid is fiirther amplified, e.g., by an 
isothermal method, prior to fiirther analysis, e.g., prior to hybridization to the capture 
probe array. The ampUfication can include, e.g., rolling circle ampUfication (RCA). 
When RCA is used, the method can include: 

(a) aimealing an effective amount of the derivative nucleic acid to a single- 
stranded circular template to yield an annealed circular template, wherein the single- 
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stranded circular template comprises (i) at least one copy of a nucleotide sequence 
complementary to the sequence of the derivative nucleic acid and optionally, (ii) at least 
one nucleotide effective to produce a cleavage site in an oligonucleotide multimer; 

(b) providing the primed circular template with effective amounts of a primer, at 
least two types of nucleotide triphosphates and a polymerase enzyme, to yield a single- 
stranded oligonucleotide multimer complementary to the circular oligonucleotide 
template, wherein the oligonucleotide multimer comprises multiple copies (amplified) of 
the sample sequence; optionally, 

(c) cleaving the oligonucleotide multimer at the cleavage site to produce the 
cleaved amplified sample nucleic acid; and 

(d) hybridizing the cleaved sample nucleic acid to the array of capture probes. 
In some embodiments, the method is used to detect single nucleotide 

polymorphisms (SNPs). Two different probes can be encoded with different tag 
sequences. Multiple pairs of signal probes can be used simultaneously with the same 
sample to multiplex the analysis of many target sequences. The products of the reaction 
can then be analyzed on a capture probe array, e.g., a PSBH array. 

Methods of the invention allow for the capture of cleavage products from cleavase 
invader assays. Accordingly, in one aspect, the invention includes a method of analyzing 
a polynucleotide sequence, e.g., a specific target nucleic acid molecule. The method 
includes: 

(a) providing (i) a target nucleic acid having a first and second portion; (ii) a first 
oligonucleotide complementary to the first portion of the target nucleic acid; and (ii) a 
second oligonucleotide having a 5' and 3' end and a region complementary to the second 
portion of the target nucleic acid, as well as a region which is not complementary to the 
target nucleic acid; 

(b) mixing the target nucleic acid, first oligonucleotide, and second nucleotide 
under conditions in which the first oUgonucleotide and an end, e.g., the 3' end of the 
second oligonucleotide are annealed to the target nucleic acid sequence so as to create a 
cleavage structure having a single-stranded arm; 

(c) cleaving the first cleavage structure under conditions such that cleavage of the 
cleavage structure occurs at a site located within the second oUgonucleotide in a manner 
dependent upon the annealing of the first and second oligonucleotides on the target 
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nucleic acid such that cleavage liberates the single-stranded arm of the cleavage structure 
to generate a third oUgonucleotide; 

(d) providing an array of a plurality of capture probes, wherein each of the capture 
probes is positionally distinguishable from other capture probes of the pluraUty on the 

5 array and wherein each of the capture probes contains a region of unique sequence; and 

(e) hybridizing the third ohgonucleotide with the array of capture probes, thereby 
analyzing the sample sequence. 

The cleavage structure can be cleaved with, e.g., a cleavage agent, such as those 
described in U.S. Patent No. 5,871,911. By "cleavage agent" is meant a molecule such as 

1 0 a DNA polymerase (DNAP), a domain of a DNAP, or a synthetically created protein or 
peptide, capable of cleaving a cleavage structure at a specific site. A preferable example 
of a cleavage agent is a 5' nuclease activity of DNAP, such as DNAPTaq, DNAPTfl, 
DNAPTth and DNAPEcl. After exposing the cleavage structure to a cleavage agent, the 
structure and agent are incubated under conditions wherein cleavage can occur. 

1 5 In some embodiments, the third oUgonucleotide has an arbitrary sequence, e.g., a 

sequence which includes an arbitrary tag sequence complementary to capture array 
probes. The sequences complementary to capture probes can be positioned within the 
second oUgonucleotide such that they are at the 3' end of the third oUgonucleotide upon 
cleavage of the second oligonucleotide. Thus, products from a multiplexed assay can be 

20 captured at different positions on an array of capture probes. 

In some embodiments, the sequence at the 5' end of the third oligonucleotide 
includes an arbitrary sequence and includes a label, e.g., a primary label, such as biotin. 
When biotin us used, a streptavidin labeled enzyme can be attached to detect the captured 
third oUgonucleotide. 

25 In some embodiments, the 5' end of the third oUgonucleotide includes a sequence 

tag, which can encode information, e.g., a specific allele for a polymorphic DNA 
sequence. The information can be detected using different reporter moieties, e.g., 
different colors of fluorophors. In some embodiments, the third oligonucleotide is 
detected using enzyme-based chemiluminescence or fluorogenic ampUfication. 
30 In some embodiments, the method is used to analyze a genetic even, e.g., a SNP 

or a mutation. The genetic event can occur in the first region, the second region, or at the 
junction of the two regions. 

In some embodiments, a biallelic or multi-aUelic SNP is analyzed. 
In some embodiments, the third oUgonucleotides from a plurality of sites are 
35 analyzed. 
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In some embodiments, the third oligonucleotide is further amplified, e.g., by an 
isothermal method, prior to hybridization to the capture probe array. The amplification 
can include, e.g., rolling circle amplification (RCA). When RCA is used, the method can 
include: 

(a) annealing an effective amount of the third ohgonucleotide to a single-stranded 
circular template to yield an annealed circular template, wherein the single-stranded 
circular template comprises (i) at least one copy of a nucleotide sequence complementary 
to the sequence of the third oligonucleotide and optionally, (ii) at least one nucleotide 
effective to produce a cleavage site in an oligonucleotide multimer; 

{b) providing the primed circular template with effective amounts of a primer, at 
least two types of nucleotide triphosphates and a polymerase enzyme, to yield a single- 
stranded oligonucleotide multimer complementary to the circular oligonucleotide 
template, wherein the oligonucleotide multimer comprises multiple copies (amplified) of 
the sample sequence; optionally, 

(c) cleaving the oligonucleotide multimer at the cleavage site to produce the 
cleaved amplified sample nucleic acid; and 

(d) hybridizing the cleaved sample nucleic acid to the array of capture probes. 
In some embodiments, the method is used to detect single nucleotide 

polymorphisms (SNPs). Two different probes can be encoded with different tag 
sequences. Multiple pairs of signal probes can be used simultaneously with the same 
sample to multiplex the analysis of many target sequences. The products of the reaction 
can then be analyzed on a capture probe array, e.g., a PSBH array. 

Methods of the invention also allow for capture of single stranded products form 
multiplexed PCR. Accordingly, in another aspect, the invention includes a method of 
analyzing a polynucleotide sequence. The method includes: 

providing a double-stranded nucleic acid sequence, wherein the 5* ends of one or 
both sequences have a region of unique sequence; 

(optionally) contacting the double-stranded nucleic acid sequence with an agent 
which promotes strand separating, e.g., a peptide-nucleic acid (PNA) having a sequence, 
e.g.,. a genetic sequence, complementary to a region near the end or ends of the double- 
stranded nucleic acid molecule, thereby exposing a region of single-stranded nucleic acid 
sequence containing the unique sequence; and 

hybridizing the nucleic acid sequence having an exposed single-stranded region to 
a capture array, thereby analyzing the nucleic acid sequence. 

In some embodiments, the hybridized nucleic acid sequence is ligated to the 
capture array. Preferably, ligation is followed by a washing step. 
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In some embodiments, the method includes providing a pair of primers, e.g., PGR 
primers, having generic tags and peptide nucleic acid (PNA)-binding sites and using the 
primers to amplify the nucleic acid sequence. PNA are discussed in detail in Kuhn et al. 
J. Mol. Evol. 286:1337-1345 (1999) and Bukanov et al., Proc. Nat. Acad. Sci. (USA) 
95:5516-20(1998). 

Methods of the invention can be used to capture unampUfied genomic targets and 
analyze them in an array. Accordingly, in another aspect, the invention features a method 
of detecting a nucleic acid sequence. The method includes: 

providing a population of double-stranded molecules having a defined sequence 
adjacent to at least one of their ends, wherein the defined sequences have a first and 
second defined regions, separated by a unique sequence region; 

(optionally) opening the duplex at one end, e.g., by contacting the double- 
stranded molecules with a first and second peptide nucleic acid complementary to the 
first and second defined regions under conditions sufficient to cause the unique sequence 
region to become single-stranded; and 

hybridizing the double-stranded molecules having the single-stranded unique 
sequence region to a capture probe array, thereby analyzing the polynucleotide sequence. 

In some embodiments, the double-stranded nucleic acid sequences are generated 
by digestion with a restriction enzyme. 

In some embodiments, the single-stranded unique region is annealed with a 
primer having an arbitrary sequence tags at its 5' end. The annealed primer is then 
extended with a DNA polymerase and other reagents known in the art. If desired, the 
extended strand is fiirther amphfied using PGR or the RGA amplification methods 
described herein. 

In some embodiments, the products of the primer extension or ampUfication are 
ligated to the capture probe arrays after hybridization. Preferably, the fragments are 
washed after ligation. The Ugated capture probes can be fiirther analyzed, e.g., by in situ 
RGA. 

In some embodiments, the capture probes have 5* overhangs. 

In some embodiments, the single-strand unique sequence region is hybridized 
with an oligonucleotide. 

In another aspect, the invention includes a method of thereby analyzing a nucleic 
acid sequence. The method includes: 

providing double-stranded fragments, e.g., genomic fragments, having at least one 
defined end, produced, e.g., by restriction digestion; 
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denatxiring an end of a fragment, e.g., by contact with a peptide nucleic acid to 
produce an open end; 

hybridizing a primer to the open end and extending it; 

hybridizing the extension produces to a capture probe array, thereby analyzing a 
5 nucleic acid sequence. 

In some embodiments, the primer has a generic sequence tag, e.g., at its 5* end, 
which can be hybridized to a capture probe, e.g., a PSBH array. 

In some embodiments, the nucleic acid sample is unamplified genomic DNA. 
In some embodiments, the products of the primer extension or amplification are 
10 ligated to the capture probe arrays after hybridization. Preferably, the fragments are 

washed after ligation. The ligated capture probes can be fiirther analyzed, e.g., by in situ 
RCA. 

In some embodiments, the capture probes have 5' overhangs. 
^ In some embodiments, the single-strand unique sequence region is hybridized 

5 15 with an oligonucleotide. 

3^ The invention also features bifimctional oligonucleotide probes and methods of 

J their use. Accordingly, in another aspect, the invention includes a method for analyzing a 

sJ nucleic acid sequence. The method includes: 

® providing a population of single-stranded nucleic acid molecules, e.g., denatured 

= 20 genomic DNA; 

contacting the population of nucleic acid molecules with an oligonucleotide to 
form a partially duplex region on the single-stranded nucleic acid molecules, wherein the 
oligonucleotide has a first region complementary to a defined nucleic acid sequence and a 
second region containing a recognition sequence for a type IIS restriction enzyme, e.g., 
25 Fokl; 

cleaving the partially duplex region with the type IIS restriction enzyme, wherein 
the restriction enzyme cleaves in the first region of the oligonucleotide to form a 
oligonucleotide digestion product; and 

hybridizing the oligonucleotide digestion product to a capture probe array, 
30 thereby analyzing the nucleic acid sequence. 

In some embodiments, the hybridized product is ligated to a capture probe array. 
The invention also features hybrid generic/custom dual probe arrays and methods 
of their use. Accordingly, in another aspect, the invention features a method for 
analyzing a nucleic acid sequence. The method includes: 
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providing an array comprising a plurality of capture probes having a single- 
stranded region and a double-stranded region, wherein preferably the plurality of capture 
probes includes a generic capture probe and a custom capture probe. 

hybridizing a sample nucleic acid to the array, wherein the hybridizing sample 
nucleic acid binds to a single-stranded region in a capture probe in the array so that the 5' 
terminus of the sample nucleic acid abuts the 3* terminus of the double-stranded region of 

the capture probe; and 

ligating the sample nucleic acid to the 3* terminus of the double-stranded region of 

the capture probe. 

In some embodiments, the sample nucleic acid is labeled, e.g., with biotin, a 
fluorophore, or one or more radiolabeled nucleotides. 

In some embodiments, the 3* terminus of the sample nucleic acid is extended 
using a DNA polymerase and sequences in the single stranded region of the capture probe 
as the template. 

In some embodiments, the extended product is labeled, e.g., with biotin, a 
fluorophore, or one or more radiolabeled nucleotides. 

The invention also features creation of custom arrays by hybridization of probe 
pools. Accordingly, in another aspect, the invention includes a method for forming a 
custom array. The method includes: 

providing an array of a plurality of foundation probes, wherein each of the 
foundation probes is positionally distinguishable from other foundation probes of the 
plurality on the array and wherein each of the foundation probes contains a region of 
unique sequence; 

contacting the foundation probe array with a plurality of custom oligonucleotides 
comprising a first and second sequence, wherein the first sequence is complementary to a 
defined sequence in a foundation probe in the foundation probe array and the second 
sequence is complementary to a target sequence, thereby forming a custom probe array. 

In some embodiments, the 5' end of the oligonucleotide abuts the 3' end of a 
nucleotide in the foundation probe array. 

In some embodiments, the oligonucleotide can be ligated to the foundation probe 

array. 

In some embodiments, a sample of nucleic acids is hybridized to the capture 
probe array and nucleic acids in the sample which hybridize to the second sequence in the 
oUgonucleotide are identified. In some embodiments, the 3' terminus of the hybridizing 
sample nucleic acid is extended using a DNA polymerase and sequences in the capture 
probe array as a template. 
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In some embodiments, the sample nucleic acids are labeled. 
The invention also features a kit for analyzing a sample of nucleic acid molecules. 
The array includes: 

a foundation probe array comprising foundation probes having a defined 

sequence; 

an oligonucleotide comprising a first and second sequence, wherein the first 
sequence is complementary to a defined sequence in a foundation probe in the foundation 
probe array and the second sequence is complementary to a target sequence. 

Methods of the invention can also be used to capture RCA products containing 
multiplex tags. Accordingly, in one aspect, the invention includes a method of analyzing 
a polynucleotide, e.g., detecting a genetic event, e.g., a single nucleotide polymorphism, 
in a sample. The method includes: 

providing a sample which includes a sample polynucleotide sequence to be 
analyzed; 

(2) (a) annealing an effective amount of sample sequence to a single-stranded 
circular template to yield an annealed circular template, wherein the single-stranded 
circular template comprises (i) at least one copy of a nucleotide sequence complementary 
to the sequence of the sample sequence and optionally, (ii) at least one nucleotide 
effective to produce a cleavage site in an oligonucleotide multimer; 

(b) providing the primed circular template with effective amounts of a primer, 
at least two types of nucleotide triphosphates and a polymerase enzyme, to yield a single- 
stranded oligonucleotide multimer complementary to the circular oligonucleotide 
template, wherein the oligonucleotide multimer comprises multiple copies (amplified) of 
the sample sequence; optionally, 

(c) cleaving the oligonucleotide multimer at the cleavage site to produce the 
cleaved amplified sample nucleic acid; and 

(3) analyzing the sample sequence firom (2) (b) or (c), e.g., by providing an 
array of a plurahty of capture probes, wherein each of the capture probes is positionally 
distinguishable fi-om other capture probes of the plurality on the array, and wherein each 
positionally distinguishable capture probe of the plurahty includes a unique (i.e., not 
repeated in another capture probe) region; and hybridizing the amplified sample sequence 
with the array of capture probes, thereby analyzing the sample sequence. 

In preferred embodiments, the amplified sequence fi-om step 2 of the method can 
be fiirther ampUfied, e.g., ampUfied by rolling circle, e.g., prior to analysis under step 3. 
In such embodiments, the amplified sample nucleic acid firom step 2, e.g., a cleaved 
amphfied sample nucleic acid, can be ampUfied fixrther. The second or other subsequent 
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rolling circle amplification can use a circular oligonucleotide probe of the same or similar 
sequence that used as Step 2, or one of a different sequence. It is also possible that the 
circular oligonucleotide in a second or subsequent rolling circle amplification, can be, for 
example, closed or open circular template. 
5 In preferred embodiments, the circular oligonucleic template (of any step) is 

prepared by a process comprising the steps of: 

(a) hybridizing each end of a linear precursor oligonucleotide to a single 
positioning oligonucleotide, e.g., a sample sequence, having a 5' nucleotide sequence 
complementary to a portion of the sequence comprising the 3' end of the linear precursor 

10 oligonucleotide and a 3* nucleotide sequence complementary to a portion of the sequence 
comprising the 5' end of the linear precursor oligonucleotide, to yield an open 
oUgonucleotide circle wherein the 5' end and the 3' end of the open circle are positioned 
so as to abut each other; and 

(b) joining the 5' end and the 3' end of the open oligonucleotide circle to yield a 
1 5 circular oligonucleotide template. Rolling circle ampUfication can be primed by the 

positioning oUgonucleotide, e.g., the target nucleic acid, or by another primer, in this or 
other methods disclosed herein. 

In preferred embodiments, analyzing a nucleic acid includes, e.g., sequencing the 
nucleic acid, e.g., by sequencing by hybridization or positional sequencing by 
20 hybridization, detecting the presence of, or identifying, a genetic event, e.g., a SNP, in a 
target nucleic acid, e.g., a DNA. 

In preferred embodiments, the genetic event is within 1, 2, 3, 4 or 5 base pairs 
fi-om the end of the target molecule, or is sufficiently close to the end of the target 
molecule that a mismatch would inhibit DNA polymerase-based extension firom a target/ 
25 primed circle. In preferred embodiments the inhibition is at least 50, 75, 90 or 99%. 

In preferred embodiments, the target is amplified, e.g., by a isothermal or 
nonisothermal method, e.g., by PGR, prior to contact with a circular template. 

In preferred embodiments the circular template includes a site for a type IIS 
restriction enzyme and the site is positioned, e.g., such that a type IIS restriction binding 
30 at the site cleaves adjacent the region which binds the sample sequence or cleaves in the 
region which binds the sample sequence. 

In a preferred embodiment a region of the circular template is complementary to a 
genetic event, e.g., a mutation or SNP, and hybridizes effectually to sample nucleic acid 
having the event and sample nucleic acid not having the event. 



G:\DATA\PLM\PKP\application.doc 




In preferred embodiments, each of the capture probes has a binding region for a 
non-specific endonuclease binding site, e.g., a type IIS restriction enzyme binding site, 
and the method includes: 

hybridizing the single stranded target nucleic acid with the capture probe array, 
5 (preferably the region of an amplification product which corresponds to the genetic event 
hybridizes with the variable region of a capture probe); 

(optionally) ligating the single stranded target nucleic acid to a strand of the 
capture probe; 

cleaving the single stranded target nucleic acid/capture probe duplex with a non- 
10 specific endonuclease, to form a cleaved single stranded target nucleic acid/capture probe 
duplex, such that a base corresponding to the genetic event is in the single stranded 
region formed by the cleavage; 

extending along the single strand which contains the genetic event with one and 
preferably with 2, 3, or all 4 labeled chain terminating nucleotides, wherein if more than 
1 5 one labeled chain terminating nucleotide is used each of the chain terminators, e.g., A or 
C, are distinguishable, such that the incorporation of a chain terminator indicates the 
presence of a genetic event, 

thereby detecting or identifying a genetic event in a target nucleic acid. 
In preferred embodiments the polynucleotide sequence is: a DNA molecule: all or 
20 part of a known gene; wild type DNA; mutant DNA; a genomic fragment, particularly a 
human genomic fragment; a cDNA, particularly a human cDNA. 

In preferred embodiments the polynucleotide sequence is: an RNA molecule: 
nucleic acids derived from RNA transcripts; wild type RNA; mutant RNA, particularly a 
human RNA. 

25 In preferred embodiments the polynucleotide sequence is: a human sequence; a 

non-human sequence, e.g., a mouse, rat, pig, primate. 

In preferred embodiments the method is performed: on a sample from a human 
subject; and a sample from a prenatal subject; as part of genetic counseling; to determine 
if the individual from which the target nucleic acid is taken should receive a drug or other 
30 treatment; to diagnose an individual for a disorder or for predisposition to a disorder; to 
stage a disease or disorder. 

In preferred embodiments the capture probes are single stranded probes in an 

array. 

In preferred embodiments the capture probes have a structure comprising a double 
35 stranded portion and a single stranded portion in an array. 
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In preferred embodiments hybridization to the array is detected by mass 
spectrophotometry, e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments probes are selected for minimal cross hybridization 
with other probes. 

5 In preferred embodiments the amplified sample sequence has attached thereto a 

first member of a proximity detector pair and hybridization to the array allows the first 
member to be brought into proximity with a second member to provide a signal. 

In a preferred embodiment the amplified sample sequence which hybridizes to a 
capture probe, or the capture probe, is the substrate of or template for an enzyme 

1 0 mediated reactions. For example, after hybridization to the capture probe, the amplified 
sample sequence is ligated to the capture probe, or after hybridization it is extended along 
the capture probe. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., an amplified sample sequence, 

1 5 a capture probe, a sequence to be analyzed, or a molecule which hybridizes thereto, is the 
substrate or template for the enzyme mediated reaction. The enzyme mediated reaction 
can be: an extension reaction, e.g., a reaction catalyzed by a polymerase; a linking 
reaction, e.g., a Ugation, e.g., a reaction catalyzed by a hgase; or a nucleic acid cleavage 
reaction, e.g., a cleavage catalyzed by a restriction enzyme, e.g., a Type IIS enzyme. The 

20 ampUfied sample sequence which hybridizes with the capture probe can be the substrate 
in an enzyme mediated reaction, e.g., it can be ligated to a strand of the capture probe or 
it can be extended along a strand of the capture probe. Alternatively, the capture probe 
can be extended along the hybridized ampUfied sample sequence. (Any of the extension 
reactors discussed herein can be performed with labeled, or chain terminating, subunits.) 

25 The capture probe duplex can be the substrate for a cleavage reaction. These reactions 
can be used to increase specificity of the method or to otherwise aid in detection, e.g., by 
providing a signal. 

Methods such as those described in U.S. Patent Nos. 5,503,980 or 5,631,134, both 
of which are hereby incorporated by reference, can be used in methods of the invention. 
30 In particular, the array and array-related steps recited herein can use methods taught in 
these patents. 

In preferred embodiments, the method includes: 
providing an array having a pluraUty of captiire probes, wherein each of the capture 
probes is a) positionally distinguishable firom the other capture probes of the pluraUty 
35 and has a imique variable region (not repeated in another capture probe of the plurality), 
b) has a variable region capable of hybridizing adjacent to the genetic event; and c) has a 
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3* end capable of serving as a priming site for extension hybridizing the amplified sample 

sequence having a genetic event to a capture probe of the array, (preferably the region of 

the amplified sample sequence having a genetic event hybridizes adjacent to the variable 

region of a capture probe); and using the 3* end of the capture probe to extend across the 

region of genomic nucleic acid having a genetic event with one or more terminating base 

species, where if more than one is used each species has a unique distinguishable label 

e.g. label 1 for base A, label 2 for base T, label 3 for base G, and label 4 for base C; 

thereby analyzing the amplified sample sequence. 

The nucleic acids, e.g., probes and primers, arrays, and other reagents or devices 

disclosed herein are also within the invention. 

In another aspect, the invention features a probe, e.g., a probe described herein, 

which is useful for rolling circle amplification. The probe includes, preferably in the 
orderofS' to3': 

a first region at one terminus of the probe, which includes a sequence which 
hybridizes to a first region of a target nucleic acid sequence; 

a second region which includes a cleavage site, e.g., a site for cleavage by a 
restriction enzyme sequence, e.g., a type lis enzyme and a capture tag sequence; 

a third region which includes a sequence complementary to a universal primer 
sequence; and 

a fourth region, at the other terminus of the probe, which includes a sequence 
which hybridizes to a second region of a target sequence. 

The total length of the probe can range, e.g., between 10 to 500 nucleotides. 
In one example, the first region of the probe can range in length from between about 9 to 
50, or between 10-15 nucleotides in length. The capture tag sequence within the second 
region of the primer can range in length between 2-100, preferably 4-20, more preferably 
4-12 and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site within the 
second region can range between 5-40 nucleotides in length. The third region of the 
sequence can range between 6-100, preferably 7-50, more preferably 9-30 and most 
preferably 10-25 and 12-20 nucleotides in length. The fourth region of the primer can 
range in length from between about 9 to 50, or between 10-15 nucleotides in length. 

Capture tag sequences should be designed to minimize secondary structure and to 
promote efficient hybridization, e.g., to an array of capture probes. In preferred 
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embodiments a capture tag is a sequence not found at a teraiinus, and more preferably not 
found in the target sequence. A sequence complementary to the universal primer should 
be designed such that the universal primer can hybridize to and direct the amplification of 
a target nucleic acid sequence. 
5 The invention also features, a plurality of rolling circle amplifying probes (a set of 

probes), wherein each probe of the plurality of probes includes a unique capture tag 
sequence. An integration site nucleotide can differ between pairs, e.g., a pluraUty can 
include pairs with 2, 3, or all of a, g, c, t as an interrogation site nucleotide. 

The total length of the primer can range between 10 to 400 nucleotides. In one 
10 example, the first region of the probe can range in length from between about 9 to 30, or 
between 10-15 nucleotides in length. The capture tag sequence within the second region 
of the probe can range in length between 2-100, preferably 4-20, more preferably 4-12 
and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range 
between 5-40 nucleotides in length. The third region of the sequence can range between 
15 6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 

nucleotides in length. The fourth region of the probe can range in length from between 
about 9 to 50, or between 10-15 nucleotides in length 

Each probe in the plurality should include a unique capture tag sequence so as to 
allow separate analysis. Thus, the capture tags will differ by at least 1 and preferably at 
20 at least 2, 3, 4, 5, 6, 10, or 20 nucleotides. In preferred embodiments the sequence of 
capture tag in each of the plurahty of probes will be sufficiently different that a sequence 
tag will not hybridize, under assay conditions used, to a sequence which is 
complementary to another capture tag in the set, in other words, they will not cross 
hybridize. Capture tag sequences can also be designed to minimize secondary structure, 
25 to promote efficient hybridization, e.g., to an array of capture probes. In preferred 

embodiments a capture tag is a sequence not found at a terminus, and more preferably not 
found in the target sequence. A set of probes which amplify the sense and antisense 
strand of a target can have the same or different capture tag sequences. 

The probes can be used to detect a genetic event, e.g., a mutation or an SNP, in a 
30 target sequence. The nucleotide complementary to the genetic event or to a nucleotide of 
a genetic event, can be degenerate in the plurality. The plurality of probes can differ 
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from each other at the end terminus of the probe. The probes should include at their 
interrogation site, or at the end close to their interrogation site a degenerate nucleotide 
which allows for ligation or polymerase extension, only when hybridized to a specific 
target. 

The invention also features a pair of primers/probes, e.g., PGR primers, which 
allow for the extension, e.g., amplification, of a particular nucleotide sequence. 
Accordingly, the invention features a pair of primers including: 

(a) a first primer which includes, preferably in the order of 5' to 3': 
a first region which includes a sequence complementary to a universal primer 
sequence; 

a second region which includes a cleavage site, e.g., a site for cleavage by a 
restriction enzyme sequence, e.g., a type lis enzyme and a capture tag sequence, wherein 
the capture sequence has preferably a different sequence than any other capture sequence; 

a third region which includes a sequence which hybridizes to a first target 
sequence; and 

(b) a second primer which includes, preferably in the order of 5' to 3 ' : 
a first region which includes a sequence complementary to a universal primer 
sequence; 

a second region which includes a cleavage site, e.g., a site for cleavage by a 
restriction enzyme sequence, e.g., a type lis enzyme and a capture tag sequence, wherein 
the capture sequence has the same or different sequence than the capture sequence of the 
first primer; and 

a third region which includes a sequence which hybridizes to a second target 
sequence. 

The total length of the primer can range between 10 to 200 nucleotides. In one 
example, the first region of the primer can range in length from between about 9 to 30, or 
between 10-15 nucleotides in length. The capture tag sequence within the second region 
of the primer can range in length between 2-100, preferably 4-20, more preferably 4-12 
and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range 
between 5-40 nucleotides in length. The third region of the sequence can range between 
6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 
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nucleotides in length. 

Each probe in the plurality should include a unique capture sequence tag so as to 
allow separate analysis. Thus, the capture tags will differ by at least 1 and preferably at 
least 2, 3, 4, 5, 6, 10, or 20 nucleotides. In preferred embodiments the sequence of the 
5 capture tag in each of the plurality of probes will be sufficiently different that a sequence 
tag will not hybridize, under assay conditions used, to a sequence which is 
complementary to another capture tag in the set, in other words, they will not cross 
hybridize. Capture tag sequences can also be designed to minimize secondary structure, 
to promote efficient hybridization, e.g., to an array of capture probes. In preferred 
10 embodiments a capture tag is a sequence not found at a terminus, and more preferably not 
found in the target sequence. A set of probes which amplify the sense and antisense 
strand of a target can have the same or different capture tag sequences. 

In one embodiment, the target sequences can be different regions on the same 
molecule. In another embodiment, the target sequences can be on different molecules. 
1 5 The invention also features a pluraUty (or set) of pairs of primers, e.g., PGR 

primers, wherein each pair of the plurality includes a unique sequence tag. An 
integration site nucleotide can differ between pairs, e.g., a plurality can include pairs with 
2, 3, or all of a, g, c, t as an interrogation site nucleotide. 

The total length of the primer can range, e.g., between 10 to 500 nucleotides. In 
20 one example, the first region of the primer can range in length from between about 9 to 
30, or between 10-15 nucleotides in length. The capture tag sequence within the second 
region of the primer can range in length between 2-100, preferably 4-20, more preferably 
4-12 and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range 
between 5-40 nucleotides in length. The third region of the sequence can range between 
25 6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 
nucleotides in length. 

A capture tag sequence in each pair of the plurality of pairs should each be unique 
so as to allow separate analysis. Thus, the capture tags will differ by at least 1 and 
preferably at at least 2, 3, 4, 5, 6, 10, or 20 nucleotides. In preferred embodiments the 
30 sequence of the capture tag in each of the plurality of probes will be sufficiently different 
that a sequence tag will not hybridize, under assay conditions used, to a sequence which 
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is complementary to another capture tag in the set, in other words, they will not cross 
hybridize. Capture tag sequences can also be designed to minimize secondary structure, 
to promote efficient hybridization, e.g., to an array of capture probes. In preferred 
embodiments a capture tag is a sequence not found at a terminus, and more preferably not 
found in the target sequence. A set of probes which amplify the sense and antisense 
strand of a target can have the same or different capture tag sequences. In a pair of 
probes the capture tag sequences can differ or they can be identical. 

The probes can be used to detect a genetic event, e.g., a mutation or an SNP, in a 
target sequence. The nucleotide complementary to the genetic event or to a nucleotide of 
a genetic event, can be degenerate in the plurality. The plurality of probes can differ 
from each other at the end terminus of the probe. The probes should include at their 
interrogation site, or at the end close to their interrogation site a degenerate nucleotide 
which allows for ligation or polymerase extension, only when hybridized to a specific 
target. 

The invention also features a pair of primers/probes, e.g., ligation or PGR primers, 
which allow for the extension, e.g., ampUfication, of a particular nucleotide sequence 
including: 

(a) a first primer in the order of 5' to 3 'having: 

a first region which includes a sequence complementary to a universal primer 
sequence; 

a second region which includes a capture tag sequence, wherein the capture 
sequence has preferably a different sequence than any other capture sequence, and a 
cleavage site, e.g., a site for cleavage by a restriction enzyme sequence, e.g., a type lis 
enzyme; 

a third region which includes a sequence which hybridizes to a first target 

sequence; and 

(b) a second primer in the order of 5' to 3 'having: 

a first region which can hybridize to a second region on the target nucleic acid 
sequence; 

a second region which includes a capture tag sequence, wherein the capture 
sequence has preferably a different sequence than any other capture sequence, and a 
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cleavage site, e.g., a site for cleavage by a restriction enzyme sequence, and 

a third region which includes a sequence complementary to a universal primer 
sequence. 

The total length of the primer can range between 10 to 200 nucleotides. In one 
example, the first region of the primer can range in length from between about 9 to 30, or 
between 10-15 nucleotides in length. The capture tag sequence within the second region 
of the primer can range in length between 2-100, preferably 4-20, more preferably 4-12 
and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range 
between 5-40 nucleotides in length. The third region of the sequence can range between 
6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 
nucleotides in length. 

A capture tag sequence in each primer should each be unique so as to allow 
separate analysis. Thus, they will differ by at least 1 and preferably at least 2, 3, 4, 5, 6, 
10, or 20. In preferred embodiments the sequence of the capture tag in each of the 
plurality of probes will be sufficiently different that a sequence tag will not hybridize, 
under assay conditions used, to a sequence which is complementary to another capture 
tag in the set, in other words, they will not cross hybridize. Capture tag sequences can 
also be designed to minimize secondary structure, to promote efficient hybridization, e.g., 
to an array of capture probes. In preferred embodiments a capture tag is a sequence not 
found at a terminus, and more preferably not found in the target sequence. A set of 
probes which ampUfy the sense and antisense strand of a target can have the same or 
different capture tag sequences. 

In one embodiment, the target sequences can be different regions on the same 
molecule. In another embodiment, the target sequences can be on different molecules. 

The invention also features a plurality of pairs of primers, e.g., ligation or PGR 
primers, which allow for the extension, e.g., amplification, of a set of target nucleotide 
sequences, wherein each of the pairs includes a unique sequence tag. An integration site 
nucleotide can differ between pairs, e.g., a plurality can include pairs with 2, 3, or all of a, 
g, c, t as an interrogation site nucleotide. 

The total length of the primer can range, e.g., between 10 to 500 nucleotides. In 
one example, the first region of the primer can range in length from between about 9 to 
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30, or between 10-15 nucleotides in length. The capture tag sequence within the second 
region of the primer can range in length between 2-100, preferably 4-20, more preferably 
4-12 and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range 
between 5-40 nucleotides in length. The third region of the sequence can range between 

5 6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 
nucleotides in length. 

Each pair should include a unique capture tag sequence so as to allow separate 
analysis. Thus, they will differ by at least 1 and preferably at at least 2, 3, 4, 5, 6, 10, or 
20. In preferred embodiments the sequence of the capture tag in each of the plurality of 

10 probes will be sufficiently different that a sequence tag will not hybridize, under assay 
conditions used, to a sequence which is complementary to another capture tag in the set, 
in other words, they will not cross hybridize. Capture tag sequences can also be designed 
to minimize secondary structure, to promote efficient hybridization, e.g., to an array of 
capture probes. In preferred embodiments a capture tag is a sequence not found at a 

1 5 terminus, and more preferably not found in the target sequence. A set of probes which 
amplify the sense and antisense strand of a target can have the same or different capture 
tag sequences. In a pair of probes the capture tag sequences can differ or they can be 
identical 

In one embodiment, the target sequences can be different regions on the same 
20 molecule. In another embodiment, the target sequences can be on different molecules. 

The probes can be used to detect a genetic event, e.g., a mutation or an SNP, in a 
target sequence. The nucleotide complementary to the genetic event or to a nucleotide of 
a genetic event, can be degenerate in the plurality. The plurality of probes can differ 
from each other at the end terminus of the probe. The probes should include at their 
25 interrogation site, or at the end close to their interrogation site a degenerate nucleotide 
which allows for ligation or polymerase extension, only when hybridized to a specific 
target. 

The invention also features a pair of probes for invader-directed cleavage, 
wherein the invader-directed cleavage pair includes: 
30 (a) an invader probe including a region which hybridizes to a first region on the 

target sequence; and 
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(b) a signal probe including, preferably in the order of 5' to 3 'including: 
optionally a signal sequence; 
a capture tag sequence; and 

a region which can hybridize to a second region on the target nucleic acid 
5 sequence. 

The total length of the probe can range between 10 to 400 nucleotides. In one 
example, the capture tag sequence probe can range in length between 2-100, preferably 4- 
20, more preferably 4-12 and most preferably 4-8 and 6-8 nucleotides in length. 

The capture tag sequences in the plurality of the probes should each be unique so 
10 as to allow separate analysis. Thus, they will differ by at least 1 and preferably at at least 
2, 3, 4, 5, 6, 10, or 20. In preferred embodiments the sequence of the capture tag in each 
of the plurality of probes will be sufficiently different that a sequence tag will not 
hybridize, under assay conditions used, to a sequence which is complementary to another 
capture tag in the set, in other words, they will not cross hybridize. Capture tag 
15 sequences can also be designed to minimize secondary structure, to promote efficient 

hybridization, e.g., to an array of capture probes. In preferred embodiments a capture tag 
is a sequence not found at a terminus, and more preferably not found in the target 
" sequence. A set of probes which amplify the sense and antisense strand of a target can 

^ have the same or different capture tag sequences. In a pair of probes the capture tag 

^ 20 sequences can differ or they can be identical. 

S In one embodiment, the target sequences can be different regions on the same 

molecule. In another embodiment, the target sequences can be on different molecules. 

The invention also features a plurality of pairs of probes for the invader-directed 
cleavage, wherein each pair of the plurality includes a unique sequence tag. An 
25 integration site nucleotide can differ between pairs, e.g., a plurality can include pairs with 
2, 3, or all of a, g, c, t as an interrogation site nucleotide. 

The total length of the probes can range between 10 to 400 nucleotides. The 
capture tag sequence can range in length between 2-100, preferably 4-20, more preferably 
4-12 and most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range 
30 between 5-40 nucleotides in length. The third region of the sequence can range between 
6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 
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nucleotides in length. 

Each probe of the plurality should include a unique capture tag sequence so as to 
allow separate analysis. Thus, capture tags will differ by at least 1 and preferably at at 
least 2, 3, 4, 5, 6, 10, or 20 nucleotides. In preferred embodiments the sequence of the 
5 capture tag in each of the plurality of probes will be sufficiently different that a sequence 
tag will not hybridize, under assay conditions used, to a sequence which is 
complementary to another capture tag in the set, in other words, they will not cross 
hybridize. Capture tag sequences can also be designed to minimize secondary structure, 
to promote efficient hybridization, e.g., to an array of capture probes. In preferred 
10 embodiments a capture tag is a sequence not found at a terminus, and more preferably not 
found in the target sequence. A set of probes which amphfy the sense and antisense 
strand of a target can have the same or different capture tag sequences. In a pair of 
probes the capture tag sequences can differ or they can be identical 

In one embodiment, the target sequences can be different regions on the same 
1 5 molecule. In another embodiment, the target sequences can be on different molecules. 
The invention further includes the derivative nucleic acid sequences or the 
plurality of derivative nucleic acid sequences described herein. For example, the 
invention includes the derivative nucleic acid sequence including one or more capture 
tags and a target nucleic acid sequence, and optionally a signal molecule. Preferably, the 
20 capture tag is at one or both of the ends of the nucleic acid sequence. In one example, the 
nucleic acid sequence is a double stranded molecule including a 3 'overhang which 
includes the capture tag. In another example, the nucleic acid sequence is a double 
stranded molecule including a 5 'overhang which includes the capture tag. In yet another 
example, the nucleic acid is a single stranded sequence including a capture probe at its 5' 
25 and 3' termini. 

A capture tag sequence is a sequence which allows identification of the probe or 
derivative nucleic acid of which it is a part. A capture tag sequence is preferably 2-100, 
more preferably 4-20, more preferably 4-12, and most preferably 4-8 or 6-8, nucleotides 
in length. Probes or derivative nucleic acids which include capture tags are generally 
30 used in sets, i.e., more than one is used, so that multiplexed analysis can be performed. 
The sequence of capture tags on the molecules of a set should each be unique so as to 
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allow separate analysis. Thus, they will differ at at least 1 and preferably at at least 2, 3, 
4, 5, 6, 10, or 20. In preferred embodiments the sequence of the capture tag in each 
molecule of a set will be sufficiently different that a sequence tag will not hybridize, 
under assay conditions used, to a sequence which is complementary to another capture 

5 tag in the set, in other words, they will not cross hybridize. Capture tag sequences can 
also be designed to minimize secondary structure, to promote efficient hybridization, e.g., 
to an array of capture probes. In preferred embodiments a capture tag is a sequence not 
found at a terminus, and more preferably not found in the target sequence. A set of 
primers which amplify the sense and antisense strand of a target can have the same or 

1 0 different capture tag sequences. 

A universal primer is one which can hybridize to and direct the amplification of a 
plurality of different nucleic acids, e.g., a plurality of different derivative nucleic acids of 
a plurality of nucleic acids having different capture tags. 
^ jl^Y^A ^rivativo nucleic acid is a nucleic acid whi c l^^ 



K^/^ 15 atone or both of its termini in a single-str^iSoverhang. The derivative sequence is 



produced only uporphe-ttymdization of a probe which includes the sequence tag as an 

inlemaHrS^ien t hybri d i re S- t^ th^ ^^r^ - ^ngunnno . fnr whir . h it i r, r . pGGific . 

The invention provides methods for the multiplexed analysis of complex mixtures 
of nucleic acids by the generation and capture of a plurality of species of derivative 

20 nucleic acid molecules, the generation of which is dependent on the presence of specific 
nucleic acid molecules in a sample. A pluraUty of derivative nucleic acid molecules is 
generated for each species of target nucleic acid molecule to be analyzed in the sample. 
The nucleic acid sequences at one terminal region of the derivative nucleic acid 
molecules are "arbitrary" tag sequences that are present initially as internal sequences in 

25 probes used to prepare the derivative nucleic acids. The derivative nucleic acid molecules 
are analyzed, e.g., captured and sorted by contiguous base stacking hybridization of their 
terminal tag regions to single-stranded overhangs of a plurality of spatially separated 
partially duplex probes. Hybridization of the tag sequences on the derivative nucleic acid 
molecules to the partially duplex probes is very specific due to the use of an optimized set 

30 of relatively short sequences in the single-stranded overhangs. The specificity may be 

enhanced by a requirement for Ugation or polymerization reactions for successful capture. 
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The invention is useful for sorting or demultiplexing the products of multiplexed 
amplification reactions or assays on microarrays or beads. 

Preferred embodiments feature tag sequences which are not found in the target 
nucleic acid sequences. This results in a more efficient, more specific assay, particularly 
5 when the tags in a reaction are chosen so as to minimize cross hybridization, i.e., 
hybridization of a first tag to a sequence complementary to a second tag. As the 
sequence tags need not and preferable are not able to hybridize to the tags under 
conditions under which the assay is performed, they can be tailored to provide efficient 
specific hybridization, e.g., to a capture array. 
10 ^n;?^ The captnrp tags usH in ffl^tfW^^-^^^"^ invenhnn are imnallv supplied ashUciTt 
A ciequences, i.e., they are flanked on both sides with other sequenc^^ i target 

sequence specific reaction are tags found at ttiejermnlusofa^ acid. The internal 
/- positioning makes it very difficultioFilrf^^ tag-continuing probes to hybridize to a 
capture probe. On thep^itf^ reacted molecules, which present the tag sequence at a 
1 5 terminus, hybridizereadily with capture probes, particularly partially duplex probes 
mg^_f\\n s}^ ctranrLcm erh fln g w h i r.h if . com plen ieiilaiv with a tag sequence -^ 

Methods of the invention are usefiil for many appUcations including 
quantification of the expression levels of specific genes, detection of the allelic variants 
of multiple polymorphic sites within individuals, screening for genetic disease, 
20 pharmacogenomic analysis to optimize drug therapy, detection of infectious agents and 
variants of these agents, and forensic analysis to identify human or animal species or 
individuals. 

Methods described herein provide for the multiplexed amplification and/or 
analysis of nucleic acid target sequences by the specific capture of molecules that are 
25 derived fi-om probe molecules ("target-specific probes") that interact with target 

molecules and that are encoded with tag sequences unrelated to the target sequences. The 
tag sequences are initially present internally in the target-specific probes. As a result of 
specific interaction of the target-specific probes with the nucleic acid target molecules, 
these tag sequences will be present at the termini of molecules derived firom the target- 
30 specific probes ("derivative nucleic acids"). 
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The enhanced discrimination of partially duplex probes is exploited for the highly 
specific capture and sorting of the derivative nucleic acids. Target-specific probe 
molecules that have not interacted in specific ways with target sequences are not captured 
since the sequence tags are not present on their termini. Thus the microarrays of generic 
probes are used to effectively demultiplex multiplexed solution phase reactions that 
produce derivative nucleic acids. 

Partially duplex arrays can be used in two ways: 1) to capture and sort amplified 
target sequences resulting from multiplexed amplification reactions such as PGR or RCA, 
and 2) to capture and sort the reaction products of multiplexed assays such as Invader 
assays. 

Derivative nucleic acid can be partitioned by specific hybridization and 
immobilized by sequence specific-recognition onto an array element containing 
oUgonucleotide(s) packaged into a device. Each array element can specifically capture 
cleaved fragments arising from a single initial recognition event representing the target 
nucleic acid in the original sample. The signal intensity detected at each element can 
then reflect the relative abundance of the target molecule relative to other nucleic acid 
targets or alleles in the original sample. Thus, a multitude of array elements can capture a 
multitude of cleaved fragment species. 

Methods of the invention provide for multiplexing nucleic acid amplification and 
detection processes. This allows for the simultaneous analysis of a large number of 
nucleic acid fragments present in the sample. In the final detection step, a specific 
quantitative signal is detected for each allele or nucleic acid fragment present in the 
sample. Spatial separation of the amplification targets into distinct array elements 
eliminates competition of simultaneously amplifying targets and improves the fideUty 
and yield of the overall process. Further, the spatial separation provides a direct means of 
marking ampUfication targets that yield insufficient or no signals due to failed 
amplification reactions that potentially arise from failure of priming events or 
mispriming. These steps minimize generation of ambiguous data and facilitates 
troubleshooting attempts. 

Also described are a variety of detection methods for qualitative and quantitative 
determination of each nucleic acid or allele present in the sample. Detection methods can 
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rely on, e.g., fluorescent signals, redox pairs, electronic detection, or enzymatic reactions. 

Devices for miniaturization and parallel analysis of nucleic acids in a sample can 
include, e.g., microplates, acrylamide gel pads, flow-through chips, and or other supports. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. All publications, patent applications, 
patents, and other references mentioned herein are incorporated by reference in their 
entirety. In case of conflict, the present specification, including definitions, will control. 
In addition, the materials, methods, and examples are illustrative only and not intended to 
be limiting. 

Other features and advantages of the invention will be apparent firom the 
following detailed description, and fi-om the claims. 
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Detailed Description 

Description of Figures 

Figure 1 is a diagram of the elements of primer adapters for multiplex PGR 
5 amplification with universal primer sequences and the capture and sorting of the 
derivative nucleic acids on partially duplex arrays. 

Figure 2 is an illustration of the protocol for multiplex PGR with two target- 
specific primers incorporating universal primer sequences, Type IIS restriction sites and 
generic capture tags, followed by capture of one or both target strands of the derivative 
10 nucleic acids. 

Figure 3 is a diagram of the elements of 5' and 3' probe/primers that are ligatable 
^ to a specific nucleic acid sequence and that can be amplified by PGR with universal 

.0 primers. 

M= Figure 4 is an illustration of the protocol for target-dependent ligation and PGR 

sj 1 5 amplification of contiguous probes incorporating universal primer sequences, Type IIS 
^ restriction sites and generic capture tags, followed by capture of one or both strands of 

£ the derivative nucleic acids. 

Figure 5 is a diagram of the elements of circularizable probes containing universal 
primer sequences. Type IIS restriction sites and generic capture tags that can be used for 
20 target-dependent ligation and ampUfication, followed by capture of one or both strands of 
the derivative nucleic acids. 

Figure 6 is an outiine of the Gleavase Invader assays (Third Wave Technologies) 
showing the elements of flap sequences that may be used for capture and detection on 
partially duplex probes. 

25 Partially Duplex Capture Probes 

Methods disclosed herein are performed with a plurality of partially duplex 
probes, each with a unique sequence in its region of single-stranded overhang. The 
overhangs may be 3' or 5' overhangs, depended on the type of molecule that is to be 
captured. 
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The partially duplex probes may be formed from two nucleic acid segments that 
are covalently linked and which form a partially self-complementary hairpin structure 
(see, e.g., US patent 5,770,365, hereby incorporated by reference). A chemical moiety 
such as an amine group may be present on the partially duplex probe, preferably on the 
5 loop region of the hairpin, to effect covalent inmiobilization to a substrate. 

The partially duplex probes may also be formed from two nucleic acid segments 
that are bound by non-covalent interactions, e.g., by hydrogen bonding of complementary 
base pairs. A chemical moiety such as an amine group may be present somewhere on one 
of the nucleic acid segments to effect covalent immobilization to a substrate. It is 
10 preferred that the binding of the two nucleic acid segments be adequately stable so that 
they form stable complexes in solution, and hence may be utilized as a preformed 
reagent. Making the regions of complementary long enough to be stable at the 
temperatiu-es and solution conditions in which they will be used will effect stable 
interactions between the segments. Incorporation of nucleic acid analogs such as peptide 
J1 1 5 nucleic acids that enhance the stability of nucleic acid hybrids will also enhance the 

stability of the complexes. Attaching binding moieties to the segments can also be used to 
effect stable interactions between the segments. For example, biotin can be added to each 
of the segments, and the segments can be combined in approximately equimolar amounts 
and allowed to form hybrids in solution. A dimeric antibody to biotin can then be added 
20 to stabilize the hybrid. The antibody can also be used to immobilize the partially duplex 
probe. 

In one preferred embodiment the partially duplex probes are immobilized on 
positionally distinguishable elements of a microarray, which may be created on a non- 
porous substrate or on a three dimensional porous substrate. The three dimensional 
25 porous substrate may consist of a hydrophilic polymer matrix, such as derivatized 
polyacrylamide, to which the probes are covalently attached. 

In another preferred embodiment the partially duplex probes are immobiUzed on 
different beads that may be distinguished by an analytical means such the spectral 
properties of electromagnetic absorption or fluorescence emission. 



m 
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The single-stranded overhangs on the partially duplex probes may be from four 
the twenty bases long. A set of single stranded probe sequences is selected from all the 
possible sequences in the overhangs - e.g., 1024 for five base overhangs or 4096 for six 
base overhangs. Probe sequences are selected to provide highly specific and efficient 
capture with minimal cross-reaction between the probes. The sequences of the single- 
stranded probe overhang sequences are unrelated to target sequences in the samples to be 
analyzed. Rather, the overhang sequences are complementary to the capture tag 
sequences that are initially present internally in probe molecules ("target-specific 
probes") that interact with the target nucleic acids in solution. As a result of specific 
interaction of the target-specific probes with the nucleic acid target molecules, these tag 
sequences will be present at the termini of molecules derived from the target-specific 
probes ("derivative nucleic acids"). 

Suitable restriction enzymes for use in the invention are listed in the table below. 



Enzyme 


Type of 
overhang 


# bases in 
overhang 


ApaBI 


3' 


5 


Bael 


3' 


5 


Bbr7l 


5" 


4 


Bpll 


3" 


5 


Bsal 


5' 


4 


BsmBI 


5' 


4 


Bsp24l 


3" 


5 


BstXI 


3' 


4 


Bbvl 


5" 


4 


BsmAI 


5' 


4 


BsmFI 


5' 


4 


Cjel 


3' 


6 


CjePI 


3' 


6 


Fokl 


5' 


4 


HaelV 


3' 


6,5 


Hgal 


5' 


5 


Sth132l 


5" 


4 


StsI 


5' 


4 



Positional Arrays 

Positional arrays suitable for the present invention include high and low density 
arrays on a two dimensional or three dimensional surface. Positional arrays include 
nucleic acid molecules, peptide nucleic acids or high affinity binding molecules of known 
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sequence attached to predefined locations on a surface. Arrays of this nature are 
described in numerous patents which are incorporated herein by reference. These 
include, e.g.. Cantor, U.S. Patent No. 5,503,980; Southern, EP 0373 203 Bl; Southern, 
U.S. Patent No. 5,700,637 and Deugau, U.S. Patent No. 5,508,169. The density of the 
5 array can range from a low density format, e.g., a microliter plate, e.g., a 96- or 384- well 
microliter plate, to a high density format, e.g. 1000 molecules/cm^, as described in, e.g., 
Fodor, U.S. Patent No. 5,445,934. 

The surface on which the arrays are formed can be two dimensional, e.g., glass, 
plastic, polystyrene, or three dimensional, e.g. polymer gel pads, e.g. polyacrylamide gel 
1 0 pads of a selected depth, width and height. 

In preferred embodiments, the target or probes bind to (and can be eluted from) 
the array at a single temperature. This can be effected by manipulating the length or 
concentration of the array or nucleic acid which hybridizes to it, by manipulating ionic 
strength or by providing modified bases. 



15 



Gel Pad Arrays 

Array-based method described herein can be practiced on gel pad arrays. Gel 
pads, including arrays of gel pads, can be prepared by a variety of methods, some of 
which are known in the art. Examples of these methods are provided in, e.g., Timofeev 

20 et al. Nucleic Acids Research (1996), Vol. 24, 3142-3148; Drobyshev et al. Gene (1997) 
188: 45-52; Livshits et al, BiophysicalJournal (1996) 71:2795-2801; Yershov et al, 
Proc, Natl Acad. Sci. USA (1996) 93:4913-4918; Dubiley et al.. Nucleic Acids Research 
(1997), Vol. 25, 2259-2265; and U.S. Patent No. 5,552,270 by Khrapko et al Each of 
the foregoing is incorporated herein by reference. Gel pad arrays are the preferred 

25 positional arrays for use in the methods described herein. 

In some embodiments, a sample which contains a target analyte, e.g., a 
polynucleotide, such as a sample which contains genomic DNA, is loaded into a gel pad. 
An array of gel pads on a first solid support can be employed to perform an analysis on a 
plurality of samples, or a plurality of probes to detect a plurality of characteristics, e.g., 

30 SNPs, of a sample or samples. The genomic DNA is preferably digested, e.g., with a 

restriction enzyme, to provide shorter fragments of DNA which can easily diffuse into the 
gel pad(s). The gel pad composition and/or the size of fragments can be selected to 
permit the target polynucleotides to diffuse into the gel pad, and/or to prevent larger 
pieces of, e.g., genomic DNA from diffusing into the gel pad. The volume of the gel 

35 pad(s) is preferably less than about 1 microliter, more preferably less than about 500, 
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100, 50, 10, 5, 1, .5, or 0.1 nanoliters per gel pad. Volumes in this range permit the 
diffusion of reactants and target to occur in a conveniently short time period (e.g., 
preferably less than 5, 2, 1, 0.5, or 0.1 minutes). After the sample polynucleotide has 
diffused into the gel pad, the remaining sample can be washed away. 

5 An "array" can be any pattern of spaced-apart gel pads disposed on a substrate. 

Arrays can be conveniently provided in a grid pattern, but other pattems can also be used. 
In preferred embodiments, a gel pad array includes at least about 10 gel pads, more 
preferably at least about 50, 100, 500, 1000, 5000, or 10000 gel pads. In some 
embodiments, the array is an array of gel pads of substantially equal size, thickness, 

10 density, and the like, e.g., to ensure that each gel pad behaves consistently when 

contacted with a test mixture. In certain embodiments, however, the pads of a gel pad 
array can differ from one another; e.g., a mixed gel pad array can be constructed which 
includes more than one size or type of gel pad, e.g., gel pads made of different gel 
materials, or which entrap different species such as reagents or polynucleotide probes. In 

15 certain preferred embodiments, gel pads in an array are less than about 1 mm in diameter 
(or along a side, e.g., in the case of square gel pads), more preferably less than about 500 
microns, still more preferably less than about 100, 75, 50, 25, 10, 5, or 1 micron in 
diameter. 

A gel pad can have any convenient dimension for use in a particular assay. In 

20 preferred embodiments, a gel pad is thin enough, and porous enough, to permit rapid 
diffusion of at least certain reaction components into the gel pad when a solution or 
suspension is place din contact with the gel pad. For example, in one embodiment, a gel 
pad array for use in sequencing by hybridization permits polynucleotide fragments from a 
sample mixture to diffuse (within a conveniently short time period) into the gel pads and 

25 hybridize to oligonucleotide capture sequences disposed within the gel pads. In certain 
preferred embodiments, a gel pad (e.g., in an array of gel pads) has a thickness of at least 
about 1, 5, 10, 20, 30, 40, 50 or 100 microns. In certain preferred embodiments, a gel pad 
(e.g., in an array of gel pads) has a thickness of less than about 1 millimeter, 500 microns, 
200, 100, 50, 40, 30, 20, 10, 5, or 1 microns. 

30 In preferred embodiments, a first gel pad (or each the first array of gel pads) 

includes a first primer, e.g., a first PGR primer. The first primer is preferably 
complementary to at least a portion of the sample polynucleotide (or to its complement). 
The first primer is preferably immobilized in the first gel pad to prevent migration of the 
primer out of the gel pad. The immobilization can be permanent or reversible, and can be 

3 5 covalent or non-co valent. 
Example 1: 
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Multiplex PGR with two target-specific primers incorporating universal 
primer sequences, Type IIS restriction sites and generic capture tags, followed by 
capture of one or both target strands of the derivative nucleic acids 

Many pairs of target-specific PGR primers/adapters (Figure 1) are used for the 
multiplex amplification of many target sequences in a sample. Each of the PGR primers 
includes the following elements: 

1) Target-specific sequences, X, at the 3' ends (Xf and Xr in Figure 2); 

2) Two universal primer sequences (Uf and Ur) at the 5' ends that are common 
to all the pairs of PGR primers used in an assay; 

3) The sequence for a Type IIS restriction endonuclease (R); 

4) One or two generic capture tag sequences (Tl and T2) at internal positions in 
the primers and at specific positions relative to the Type IIS restriction 
endonuclease recognition sites, such that a double stranded DNA molecule 
containing these sequences will be cleaved adjacent to the tag sequence 
leaving the tag sequence in a single-stranded overhang. 

The protocol is illustrated in Figure 2. PGR is performed in two phases with the 
greatest amplification accomplished in the second phase by universal primers 
complementary to Uf and Ur. (see e.g., Brownie et al, Nucleic Acids Research 25 :3235, 
1997; U.S. patent 5,858,989, Jeffreys et al; Favis et al, Nature Biotechnology 18 :561, 
2000). 

The products are digested with a Type IIS restriction endonuclease to leave the 
tag sequences in single-stranded overhangs on the ends of the fi-agments, which products 
are called derivative nucleic acids. The generic tags are complementary to overhangs on 
partially duplex probes and hybridize to them. Ligation to the partially duplex probes 
followed by stringent washing results in the covalent capture of single-stranded DNA 
containing the sequence to be analyzed. Both strands of the target may be captured on 
separate array elements if Tl and T2 are different sequences. A polymorphism may be 
analyzed by primer extension, for example. Polymorphisms can be analyzed in both 
strands simultaneously on a set of positionally distinguishable partially duplex probes. 

Example 2: 
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Target-dependent ligation and PCR amplification of contiguous probes 
incorporating universal primer sequences, Type IIS restriction sites and generic 
capture tags, followed by capture of one or both strands of the derivative nucleic 
acids 

Many pairs of target-specific PCR probe/primers (Figure 3) are hybridized and 
ligated to different target sites in the nucleic acid sample. Each of the probe/primers 
includes the following elements (Figure 3): 

1) Target-specific sequences at the 3' end (X3' and X 5', respectively) which are 
complementary to contiguous sequences of the targets and which can be 
enzymatically ligated if, and only if, the probe sequences match the target 
sequence (e.g., the position marked "SNP") at the termini; 

2) Two universal primer sequences (Uf and Ur) that are common to all the pairs 
of probes used in an assay; 

3) The sequence for a Type IIS restriction endonuclease (R); 

4) A capture tag sequence (Tl or T2 on the 5' and 3' probe/primers, 
respectively) at internal positions and at specific positions relative to the Type 
IIS restriction endonuclease recognition site, such that a double stranded DNA 
molecule containing these sequences will be cleaved adjacent to the tag 
sequence leaving the tag sequence in a single-stranded overhang; 

5) An optional signal sequence situated between the restriction site and the 
target-specific sequence (not shown in Figure 3) that can encode information 
about the probe such as the identity of the nucleotide at the terminus of the 
probe. For the scoring of SNPs, one of four generic sequences would be 
present in the probes to encode one of the four possible bases. 

The protocol is illustrated in Figure 4. Ligated probe/primers are amplified by 
PCR using universal primer pairs complementary to Uf and Ur (common to all the 
circularizable probes used to analyze targets in a sample) to produce double stranded 
products (see e.g., Zhang, et al., Gene 211 :277 (1998); US Patent 5,876,924). 

The resulting amplification products are digested with a Type IIS restriction 
enzyme which leaves the tag sequences (initially positioned intemally in the 
circularizable probe) on single stranded overhangs on the products of the digestion. These 
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products, called derivative nucleic acids, are then captured and sorted on partially duplex 
probes. Both strands of the target may be captured on separate array elements if Tl and 
T2 are different sequences. A polymorphism may be analyzed by primer extension, for 
example. Polymorphisms can be analyzed in both strands simultaneously on a set of 
positionally distinguishable partially duplex probes. 

The presence of specific sequences in target nucleic acids in the sample may be 
deduced by the presence of derivative nucleic acids on specific partially duplex probes. 
Two alternative methods may be used (in this or other methods described herein) to 
analyze target sequences for specific nucleotides at polymorphic sites (SNPs): 

1) At least two different types of one of the probe/primers are prepared with 
different nucleotides at the site of Ugation (complementary to the site of a 
SNP) and with different signal sequences that encode the identity of the 
nucleotide. Only the probe molecules that are complementary to the target 
sequence will be Ugated and ampUfied (Zhang, et al.. Gene 211 :277 (1998); 
US Patent 5,876,924). All of the allele-specific probes for a SNP (i.e., two 
probes for a biallelic SNP) contain the same capture tag sequence, so all (both 
for a bialleUc SNP) of the resulting derivative nucleic acids are captured by 
the same partially duplex probe. The allele(s) of the polymorphism is 
determined by detection the signal sequence with generic hybridization probes 
complementary to the signal sequences in the derivative nucleic acids. For a 
biallelic SNPs two generic hybridization probes labeled with distinguishable 
fluorophors will reveal the allele status of the SNP. 

2) Probe/primers are prepared with degenerate nucleotides at the ligation site 
(complementary to the site of a SNP). Only the probe molecules that are 
complementary to the target sequence will be ligated and amplified. The 
resulting derivative nucleic acids are captured on partially duplex probes, and 
the allele status of the target is determined by a variety of means, such as 
primer extension reactions. 

Example 3: 
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Target-dependent ligation and amplification of circularizable probes 
containing universal primer sequences, Type IIS restriction sites and generic 
capture tags, followed by capture of one or both strands of the derivative nucleic 
acids 

Many different circularizable probes are hybridized and ligated to different target 
sites in the nucleic acid sample. Each of the circularizable probes includes the following 
elements (Figure 5): 

1) Target-specific sequences at the 3' and 5' ends (X3' and X 5', respectively) 
which are complementary to contiguous sequences of the targets and which 
can be enzymatically Ugated if, and only if, the probe sequences match the 
target sequence (e.g., the position marked "SNP") at the termini; 

2) Two universal primer sequences (Uf and Ur) that are common to all the 
circularizable probes used in an assay; 

3) The sequence for a Type IIS restriction endonuclease (R); 

4) One or two capture tag sequences (Tl and T2) at internal positions and at 
specific positions relative to the Type IIS restriction endonuclease recognition 
sites, such that a double stranded DNA molecule containing these sequences 
will be cleaved adjacent to the tag sequence leaving the tag sequence in a 
single-stranded overhang; 

5) An optional signal sequence (S) that can encode information about the 
probe such as the identity of the nucleotide at one of the termini of the 
probe. For the scoring of SNPs, one of four generic sequences would be 
present in the probes to encode one of the four possible bases, 

Ligated (circularized) probes are amplified using universal primers (common to 
all the circularizable probes used to analyze targets in a sample) to produce double 
stranded products (see e.g., Thomas et al, Arch Pathol Lab Med 123 :1170, 1999; Zhang, 
et al, Gene 21 1 :277 (1998); US Patent 5,876,924). The priming sites are situated in the 
circularizable probe so that the amplification is ligation dependent (see e.g., Thomas et 
al, Arch Pathol Lab Med 123 :1 170, 1999; Zhang, et al., Gene 211 :277 (1998); US Patent 
5,876,924), and so that the target dependent sequence, the type II S restriction site, the 
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capture tag sequence and the optional signal sequence are amplified. The direction in 
which the primers are extended is indicated by the arrows in Figure 5. 

The resulting multimer amplification products are digested with a Type IIS 
restriction enzyme which leaves the generic tag sequences (initially positioned internally 
in the circularizable probe) on single stranded overhangs on the products of the digestion. 
These products, called derivative nucleic acids, are then captured and sorted on partially 
duplex probes. 

The presence of specific sequences in target nucleic acids in the sample may be 
deduced by the presence of derivative nucleic acids on specific partially duplex probes. 
Two alternative methods may be used to analyze target sequences for specific nucleotides 
at polymorphic sites (SNPs): 

1) Separate circularizable probes are prepared with different nucleotides at the 
site of Ugation (complementary to the site of a SNP) and with different signal 
sequences that encode the identity of the nucleotide. Only the probe molecules 
that are complementary to the target sequence will be ligated and circularized. 
All of the allele-specific probes for a SNP (i.e., two probes for a biallelic 
SNP) contain the same capture tag sequence, so all (both for a biallelic SNP) 
of the resulting derivative nucleic acids are captured by the same partially 
duplex probe. The allele(s) of the polymorphism is determined by detection 
the signal sequence with generic hybridization probes complementary to the 
signal sequences in the derivative nucleic acids. For a biallelic SNPs two 
generic hybridization probes labeled with distinguishable fluorophors will 
reveal the allele status of the SNP. 

2) Probes are prepared with degenerate nucleotides at the ligation site 
(complementary to the site of a SNP). Only the probe molecules that are 
complementary to the target sequence will be ligated and circularized. The 
resulting derivative nucleic acids are captured on partially duplex probes, 
and the allele status of the target is determined by a variety of means, such 
as primer extension reactions. 
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Example 4: 

Specific capture and sorting of flap products from multiplexed Cleavase 
Invader assays on partially duplex arrays 

Many different pairs of Invader probes are hybridized to different target sites in the 
nucleic acid sample. Each pair consists of an invader oligonucleotide which forms a 
relatively stable hybrid with the target nucleic acid and a signal probe which hybridizes 
transiently to the target at a position adjacent to the invader probe (see e.g., Ryan et al., 
Mol Diagn 4 :135, 1999, Griffin et al., Proc Natl Acad Sci U S A 96 :6301, 1999). Many 
signal probes cycle on and off the target sequence, and they are cleaved by an 
endonuclease if and only if the sequence of the signal probe matches the target at the 
overlap between the signal probe and the invader probe. The signal probe includes the 
following elements (Figure 6): 

1) A3' region with sequence complementary to the target; 

2) An arbitrary capture tag sequence that is initially positioned internally and that 
includes at least one target-specific nucleotide that will be present on the 3' 
terminus of the expected cleavage product; 

3) A 5' region which can have an arbitrary signal sequence or a label moiety for 
detection of the product. 

The flap products (derivative oligonucleotides) resulting fi-om a multiplexed 
invader assay are captured and sorted by different partially duplex probes. The label on 
the 5' end of the flap can encode information about the probe that was cleaved such as 
the identity of the nucleotide in the overlap between the invader probe and the signal 
probe and hence the allele of a polymorphism. For a biallelic SNPs two generic 
hybridization probes complementary to the generic signal sequences and labeled with 
distinguishable fluorophors will reveal the allele status of the SNP. Alternatively, the 
generic hybridization probes used to detect the captured flaps (derivative 
oligonucleotides) can serve as generic primer sequences for rolling circle ampUfication 
with preformed circle. With this method of detection great signal ampUfication can be 
achieved. 
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